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Foreivord 


AS WE go about our daily business, we make fre- 
quent use of whole numbers, like 1, 2, 3, and 4, which arise 
when we count collections of objects. We also use fractions, 
like i and i, which arise when measurements are made. 
These are obviously two different kinds of number, because 
fractions can never arise as a result of counting alone. In 
what sense then, do they both deserve to be called num- 
bers? When we say that the fraction 4/2 is equal to the 
whole number 2, what does this mean? How can a number 
of one kind be equal to a number of a totally different kind? 

These are among the questions which modern mathe- 
matics has explored, and for which it has found answers. 
Here are others: 

We sometimes have occasion to use a number like the 
square-root of two. It seems like a very elusive number that 
is reluctant to show its face. Either it hides shyly behind a 
symbol like \/% or it reveals itself only piecemeal as a deci- 
mal, 1.4, 1.41, 1.414, in a process that never ends. Why 
doesn’t it behave itself and settle down like a decent 
ordinary fraction with a definite numerator and denomi- 
nator that can determine its value once and for all? 

In elementary algebra we were introduced to negative 
numbers, and taught such mysterious rules as that the 
product of two negative numbers is a positive number. 
Where does such a rule come from? 

The electrical engineer uses the number \/— 1 in the 
equations that describe the behavior of an alternating cur- 
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rent. This number is called “imaginary,” yet there is noth- 
ing imaginary about the electrical current it helps to 
describe. It is a genuine number, mathematicians assure us, 
although it is not real. What is the meaning of this para- 
dox? 

These are some of the questions that we shall look into 
in the course of this book as we take a close look at the 
numbers of everyday life. We shall find the answers in the 
fact that our number system has not beer, static, but has 
been growing, while our conception of what constitutes a 
number has changed. As we trace this growth, we shall dis- 
cover the familiar roots of the unfamiliar concepts and 
terms of modern mathematics. 

For mathematics, one of the oldest of the sciences, is 
growing with the vigor and vitality of youth. It is con- 
stantly expanding into new areas of investigation, and 
works with new concepts that are the fruit of a century-old 
revolution in mathematical thinking. Associated with the 
new ideas is a new vocabulary that gives modem mathe- 
matical writing its characteristic flavor. To the mathema- 
tician, the new ideas expressed by the new words serve as 
a bright light that penetrates to the core of a problem and 
helps him see and understand. To the layman, the new vo- 
cabulary is often an opaque screen behind which things are 
going on that he feels he cannot hope Jo understand. The 
purpose of this book is to remove that screen, by introduc- 
ing the reader to the meaning of some of the basic ideas of 
modern mathematics. 

This book is addressed to the average reader who is curi- 
ous about the new developments in mathematics. It is not 
a refresher course in high school mathematics. It is not a 
rehash of old ideas, but an introduction to new ideas, tradi- 
tionally presented only to the specialist, in advanced 
mathematics courses on the college senior or graduate level. 
But, although the ideas are advanced, the presentation is 
elementary. Anybody who has had high school algebra and 
geometry will be able to understand and enjoy this book. 

A typical text in advanced mathematics today bristles 
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with such terms as group, ring, field, homomorphism, iso- 
morphism, and homeomorphism. These unfamiliar looking 
words make it seem as though mathematics has abandoned 
its old subject matter, and is no longer concerned with 
the study of numbers and space. This, of course, is not true. 
Numbers and space are still very much at the heart of 
mathematics. The new ideas and terms have arisen in con- 
nection with a more penetrating analysis of their properties. 

Underlying the terms group, ring, and field, for exam- 
ple, are the old, familiar, simple operations of addition, sub- 
traction, multiplication, and division. The mathematician 
has discovered that these operations are not the exclusive 
property of numbers alone. So he studies them in their most 
general form, in order to discover rules that will be valid 
in any context in which the operations are performed. 

The outlook of the modern mathematician is indicated 
in his frequent use of the word stem “morph,” meaning 
form, as in the words homomorphism, isomorphism, and 
homeomorphism. The mathematician sees the number sys- 
tem as a complex of interrelated structures. He studies 
these structures separately, and in their relationships to 
each other. The exploration of these structures has revealed 
that we have, not a number system, but number systems; 
not algebra, but algebras ; not geometry, but geometries ; not 
space, but spaces. While the properties of numbers and 
space have been generalized, the subject matter of mathe- 
matics has been pluralized. 

The central thread around which the book is organized 
is the expansion of the number system, from natural num- 
bers to integers to rational numbers to real numbers to 
complex numbers. Although this sequence of steps in the 
development of the number system parallels very roughly 
historical stages in the development of the concept of num- 
ber, the organization of the book is not chronological or 
historical. It is a logical organization from the modern point 
of view, showing how the various number systems are re- 
lated to each other. The development outlined here might 
be referred to as “operation bootstrap.” The system of 
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natural numbers (the whole numbers used for counting) 
has defects that limit its usefulness. The story presented 
here shows how mathematics has lifted itself by its own 
bootstraps, using the defective system of natural numbers 
to construct bigger and better number systems that elimi- 
nate the defects. 

At each stage of the construction of the expanded num- 
ber systems, we encounter some of the structures such as 
groups, rings and fields that receive so much attention in 
modern mathematics. These modern concepts are intro- 
duced first by means of familiar examples in the number 
systems, and then other less familiar examples are given, 
too. As he reads the sections devoted to these modern con- 
cepts, the reader will be aware of the fact that he is merely 
nibbling at the corner of a great rug that has a beautiful but 
intricate design woven into it. If what he sees from the 
corner arouses his curiosity about the main design, it is 
hoped that he will satisfy this curiosity by systematic 
study from some of the standard text-books. A bibliography 
is given at the end of the book. 

To get the most value and enjoyment out of this book, 
read it with pencil and paper in hand. Verify the steps of 
each argument, work through all examples given, and do 
other examples like them. A “Do It Yourself” section at 
the end of each chapter gives you an opportunity to 
strengthen through use your understanding of the new ideas 
you will acquire in the book. 
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THE NEW MATHEMATICS 





CHAPTER I 


Numbers for Counting 


WE ARE thoroughly familiar with the faces of the 
people with whom we live. Yet we are rarely conscious of 
the details of their features. If, as we look at a familiar face, 
we do take particular notice of the details, such as a curve 
of the lip, or a line in the forehead, it seems as though we 
are seeing them for the first time. Then, seeing these features 
that we never notice, we suddenly have the feeling that we 
are looking at the face of a stranger. We shall have a similar 
experience with the familiar numbers of everyday life. When 
we use these numbers, we take advantage of certain prop- 
erties that they have. However, we are so accustomed to 
these properties that we are hardly aware of them as we 
use them. We shall now take particular notice of these 
properties, and list them explicitly. Looking at the familiar 
features of ordinary numbers, we shall see the strange new 
face of modern mathematics. 

The first numbers we all learn to use are those we need 
to answer the question, “How many?” They are the num- 
bers 1, 2, 3, 4, 5, and so on. There is an endless chain of these 
numbers. We use them for counting, and we perform cal- 
culations with them, such as addition and multiplication. 
Let us take a close look at these simple acts. 

Counting 

Suppose that on Tuesday evening you want to see how 
many days are left till the end of the week. It is likely 
that you will take count in this way: You will call off the 
names of the days, Wednesday, Thursday, Friday, and Sat- 
urday, and, for each day that you name, you will turn down 
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one finger on your right hand. After completing the list of 
days, you find that you have turned down all the fingers on 
your right hand except the thumb. So you conclude that 
there are jour days left till the end of the week. We find 
hidden in this procedure three important mathematical 
concepts: the idea of a mapping, the idea of a one-to-one 
correspondence, and the idea of cardinal number. 

A mapping is a matching operation between two sets of 
objects: to each member of one set a member of the other 
set is assigned as partner. The two sets in this case are the 
set of days being counted, and the set of fingers on your 
hand. You set up a mapping when you single out a finger to 
turn down for each day you count. The mapping might be 
summarized in the following table: 

Wednesday — » little finger 
Thursday — > ring finger 
Friday — > middle finger 

Saturday — > index finger 

The arrowheads indicate that the mapping has a direction. 
You select a finger for each day you name. This is not the 
same as selecting a day for each finger. To specify the di- 
rection of the mapping, we say that it is a mapping of the 
set of days named into the set of fingers. We refer to 
the finger into which a day is mapped as its image under the 
mapping. 

Another mapping is shown in the diagram below. In this 
mapping, a set of names of people has been mapped into 
the set of whole numbers from 20 to 24 by assigning to each 
name the person’s age in years : 


John < — *»20 

Richard 21 

William 22 

Mary >-23 

Susan >■ 24 


This mapping differs from the other one in one important 
respect. The two names, Richard and William, are both 

14 


mapped into the same number. This is an example of a 
many-to-one mapping, in which a single object may be the 
image of more than one object. In the mapping of days into 
fingers, however, no two days were mapped into the same 
finger. This is an example of a one-to-one mapping, in which 
each object is the image of at most one object. 

In the mapping of the set of days into the fingers of the 
right hand, one of the fingers, the thumb, wasn’t used at 
all. For this reason the mapping of the set of days into the 
set of fingers on the right hand is not reversible. If we try 
to reverse it, we find that there is no mapping of the thumb 
into a day. We do not consider it a mapping then, because 
a mapping should provide an image for each object in the 
set on which the mapping is performed. However, if we 
consider only the set of fingers turned down, then the map- 
ping is reversible. Then, while each day named has a sepa- 
rate finger as its image, in the reverse mapping, each finger 
has a separate day as its image. In this case we say that the 
two sets are in one-to-one correspondence. Two sets are in 
one-to-one correspondence when there is a reversible map- 
ping that assigns each member of one set to one and only one 
partner in the other. The diagram below, using double- 
headed arrows, shows the one-to-one correspondence be- 
tween the set of days and the set of fingers turned down : 

Wednesday < > little finger 

Thursday < » ring finger 

Friday < » middle finger 

Saturday < > index finger 

When two sets can be put into one-to-one correspondence 
by some mapping, we say that they contain the same num- 
ber of objects, or have the same cardinal number. All sets 
that have the same cardinal number can be put into one-to- 
one correspondence with each other. They form a family 
of sets associated with that cardinal number. Each cardinal 
number has its own family of sets. For example, sets con- 
sisting of single objects only belong to the family of sets 
associated with the number we call one. Sets of pairs of 
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objects belong to the family of sets associated with the num- 
ber we call two. Sets of triples belong to the family of sets 
associated with the number we call three, and so on. 

Any set we ever deal with belongs to one of these families. 
When we ask the question, “How many objects are there 
in the set?”, it is really like asking, “Which family of sets 
does it belong to?” To answer the question, we follow this 
procedure: We pick one set from each family, and use it as 
a standard set for making comparisons. We match the set 
we are interested in against these standard sets, until we 
find one with which it can be put into one-to-one corre- 
spondence. In this way we identify the family of sets that 
it belongs to, and the cardinal number associated with that 
family. This is precisely what you do when you match days 
against fingers. You use the set consisting of your little 
finger alone as a standard set to represent the number one. 
You use the set consisting of little finger and ring finger as a 
standard set to represent the number two. You use the set 
consisting of little finger, ring finger, and middle finger as 
a standard set to represent the number three. The set con- 
sisting of little finger, ring finger, middle finger, and index 
finger is your standard set representing the number jour. 
That is why you drew the conclusion, in this case, that 
there are four more days to the end of the week. 

On other occasions, we use a method of counting that is 
more sophisticated but is essentially the same. We count 
out four objects by saying to ourselves, “one, two, three, 
four.” As we count, we set up a one-to-one correspondence 
between the objects we are counting and sets of spoken 
number-names. The first object is matched with the set 
consisting of the single word, “one.” The first two are 
matched with the set consisting of the words, “one, two.” 
The first three are matched with the set consisting of the 
words, “one, two, three.” And so on. By using the number 
names of the numbers in order of size, we keep enlarging the 
standard set step by step. When the count ends, we know 
that the last number-name used is the cardinal number of 
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the last standard set against which we matched the objects 
we are counting. So it is also the cardinal number of the 
counted objects. By using standard sets made up of number 
names arranged in order we telescope a whole series of 
matching operations into one, and end up with the answer 
to the question, “How many?” 

Addition 

A typical problem in addition is to find the sum of the 
numbers 2 and 3. The meaning of this problem can be re- 
stated in terms of sets of objects in this way: Suppose you 
have one set of objects whose cardinal number is 2, and 
another set of objects, different from those in the first set, 
and whose cardinal number is 3. A larger set is formed 
when these two sets are united. What is the cardinal num- 
ber of the united set? 

We can answer this question by actually forming such a 
united set, and then identifying the standard set with which 
it can be put into one-to-one correspondence. This is the 
procedure of the beginner in arithmetic, who first turns 
down two fingers, then turns down three more fingers, and 
finally matches the set of turned down fingers against the 
standard set consisting of the spoken words, “one, two, 
three, four, five.” However, experienced calculators use a 
short-cut for getting the answer. Having carried out the 
process of uniting and counting sets of objects many times 
before, we record the results in an addition table which we 
memorize. Then, any time we want to find the sum of two 
numbers, we don’t have to manipulate sets of objects again. 
We merely consult the table. 

Using the addition table instead of adding on our fingers 
is more than just a time-saving convenience. It is an act of 
abstraction that has changed the meaning of addition. When 
we add on our fingers, we are actually working with cardinal 
numbers, which are properties of sets of objects. When we 
use the addition table, we are performing an operation on 
abstract symbols. This operation can now be performed 
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without regard to what the symbols stood for in the first 
place. Let us examine this abstract operation a little more 
closely. 

We have a set of symbols, 1, 2, 3, 4, and so on, that we 
call numbers. If we pick any one of them, and then pick 
another one, the addition table assigns to this pair of num- 
bers another number called its sum. For example, if we 
pick the number 2 first, and the number 3 next, the table 
assigns as their sum the number 5. We have picked the pair 
of numbers 2 and 3 in a definite order, specifying that the 
two is first and the 3 is second. So we can refer to it as the 
ordered pair (2, 3). What the addition table does is assign 
a definite number called the sum to every ordered pair of 
numbers. When we describe it in this way, we recognize 
that addition is a mapping. It maps the set of ordered pairs 
of numbers into the set of single numbers. We usually show 
the mapping in a series of statements like this: 2 + 3 = 5, 
2 + 4 = 6, 2 + 5 = 7, etc. Its nature as a mapping shows 
up more clearly if we write it this way: 

(2, 3) 5 

(2, 4) — U 6 

(2, 5) — U 7 
etc. 

The plus sign printed over the arrows is a reminder that the 
mapping shown is the special mapping called addition. It is 
only one of a multitude of possible mappings. We could, if 
we wished, set up a mapping of pairs of numbers into the 
letters of the alphabet. If we wished, we could map pairs of 
numbers into people’s names. We have a wide choice of 
mappings, because we can map any set of objects into any 
other set of objects in any way we please. The plus sign 
symbolizing addition calls our attention to the fact that it 
is a very special mapping with special characteristics. Now 
let us examine some of these characteristics. 
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We observe, first, that the numbers we are pairing off in 
our ordered pairs are selected from the list of symbols, 1, 2, 
3, 4, etc. To distinguish this set of symbols from the cardinal 
numbers from which they were derived, let us give it a 
name. We shall call it the system of natural numbers or 
counting numbers. We observe next that the number as- 
signed as sum to each ordered pair is selected from the same 
set, the system of natural numbers. A mapping which as- 
signs to each ordered pair of objects in a set another object 
selected from the same set is called a binary operation. So 
addition is an example of a binary operation defined on the 
system of natural numbers. 

If we consult the addition table, we find that 1+4 = 5, 
2 + 3 = 5, 3 + 2 = 5, and 4 + 1 = 5. The different ordered 
pairs, (1, 4), (2, 3), (3, 2), and (4, 1) are all mapped into 
the same image, 5. So addition is a many-to-one mapping. 
In particular, an ordered pair like (2, 3) and the pair (3, 2), 
obtained by having the 2 and 3 change places, both have 
the same image. We could write 2 + 3 = 3 + 2. A similar 
statement is true for the sum of every ordered pair of natural 
numbers. We find that 5 + 2 = 2 + 5, 9 + 16 = 16 + 9, 
etc. This characteristic of the addition of natural numbers 
can be summarized in the following rule: If the letter a 
stands for any natural number, and the letter b stands for 
any natural number, then a + b = b + a. That is, if the 
natural numbers being added commute or change places, 
the sum is still the same. So this rule is known as the com- 
mutative law of addition, and we say that addition of 
natural numbers is a commutative operation. 

We are so accustomed to using the commutative law of 
addition that it may seem to be obvious, and hardly worth 
mentioning. But it needs special mention because, while 
some binary operations, like addition of natural numbers, 
obey a commutative law, there are others that do not. For 
example, one of the operations we learned in elementary 
school arithmetic is called division, and is denoted by the 
symbol -K It is not a commutative operation, because the 
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numbers being divided cannot, in general, change places 
without changing the result. For example, 8 -4- 2 is not equal 
to 2 —4— 8. 

Addition, as we have talked about it so far, is an opera- 
tion performed on a pair of numbers. We can also extend 
it to three numbers. We can add three numbers by first 
adding two of them, and then adding the sum to the third. 
However, for three numbers like 2, 3, and 7, listed in a defi- 
nite order, we have a choice of two ways of doing it. We 
might add the sum of 2 and 3 to 7, or we might add 2 to 
the sum of 3 and 7. These two possibilities can be written 
down in this form: (2 + 3) + 7, and 2 + (3 + 7). In this 
notation, the parentheses indicate which sum is to be found 
first. When we carry out these additions, we find that it 
doesn’t make any difference which sum is found first, be- 
cause the results come out the same: (2 + 3) + 7 = 5 + 

7 = 12, and 2 + (3 + 7) = 2 + 10 = 12. This is a charac- 
teristic of the addition of three natural numbers, no matter 
what numbers are used. It is expressed in the rule, ( a + b) 
+ c = a+ (6 + c), where a, b, and c stand for any natural 
numbers. This rule says that in the first step of the ad- 
dition we are free to associate the middle number either 
with the number on the left or with the number on the 
right. So the rule is known as the associative law of addi- 
tion, and we say that addition of natural numbers is an as- 
sociative operation. Since it makes no difference which pair 
of numbers we add first, we may as well leave out the 
parentheses altogether, and write the sum of a, b, and c as 
a + 6 + c, where it is understood that a+b + c = a + 
(6 + c) = (a + b) + c. 

The associative law, too, deserves special mention be- 
cause it is a special property of addition of natural num- 
bers, which it shares with some binary operations but not 
with all. For example, suppose we use the symbol av to des- 
ignate the operation “take the average of.” It is a binary 
operation that can be performed on the familiar whole num- 
bers and fractions that we use every day. In this notation, 

8 av 16 means the average of 8 and 16, which is 12. The 
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symbol 12 av 12 means the average of 12 and 12, which is 
also 12. The symbol 16 av 12 means 14, and the symbol 
8 av 14 means 11. This operation does not obey an as- 
sociative law, because (8 av 16) av 12 is not equal to 8 av 
(16 av 12). In fact, (8 av 16) av 12 means 12 av 12, or 12, 
while 8 av (16 av 12) means 8 av 14, or 11. 

By a step-by-step process, the use of the commutative 
law and the associative law for addition of natural numbers 
can be extended into a general rule for the sum of any 
finite selection of natural numbers: When you add a finite 
selection of natural numbers, you can list them in any 
order, and group them as you please. The sum will always 
come out the same. 

Multiplication 

The meaning of multiplication of natural numbers, like 
the meaning of addition, can be stated first in terms of 
sets of objects. To multiply 2 times 3, we set up a rectangu- 
lar array of objects, consisting of two rows, with three ob- 
jects in each row. Then we find the cardinal number of 

• • • 

• • • 

this set. In general, to multiply the numbers a and b, we 
find the cardinal number of a set consisting of a rows with 
b objects in each row. The answer is called the product of 
a and b, and is designated by a • b, where we use a dot as 
the symbol for multiplication. Once we have found the 
product of two natural numbers, we can record it for future 
reference in a multiplication table. Then we can separate 
the operation of multiplication from its original meaning 
of finding the cardinal number of a rectangular array of 
objects. We can think of it instead as merely a mapping 
of ordered pairs of natural numbers into the system of natu- 
ral numbers. We usually show the mapping by a series of 
statements like this: 2-3 = 6, 2-4 = 8, 2 ■ 5 = 10, etc. 
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However, as in the case of addition, we can express it with 
the help of arrows: 

(2, 3) — - — ► 6 

(2, 4) — > 8 

(2, 5) — ♦ 10 
etc. 

Since the mapping is defined for every ordered pair of 
natural numbers, and the image under the mapping is al- 
ways a natural number, multiplication, like addition, is a 
binary operation on the system of natural numbers. We 
know from our experience with multiplication of natural 
numbers that 2-3 = 3- 2, 2-4 = 4- 2, 2-5 = 5- 2, etc. 
In general, if a and b are any natural numbers, a • b = 
b • a. This is known as the commutative law of multiplica- 
tion. Multiplication, like addition, also obeys an associative 
law: a • (6 • c) = (o • b) • c. This is seen, for example, in 
the fact that 2 • (3 ■ 5) = 2 • 15 = 30, and (2 • 3) • 5 = 
6 • 5 = 30. Because of this law, we can write the product 
of three numbers without parentheses, and give it a defi- 
nite meaning: a • b • c = a • (b • c) = (a ■ b) ■ c. Com- 
bining the commutative law and associative law of multi- 
plication leads to the general rule: when you multiply a 
finite selection of natural numbers, you can list them in any 
order, and group them as you please. The product will al- 
ways come out the same. 

There is one more characteristic of the multiplication 
of natural numbers that links it with addition. We can 
understand it best by going back to the original meaning of 
multiplication as finding the cardinal number of a rectangu- 
lar array, and the original meaning of addition as finding 
the cardinal number of a united set. Printed below is a 
rectangular array of stars, consisting of three rows with 
nine stars in each row. The number of stars in this rectangu- 
lar array is 3 • 9. Since a row of nine stars can be thought 
of as the first five stars united with four other stars, we 
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can write 9 = 5 + 4. So the number of stars in the rectangu- 
lar array can also be written as 3 • (5 + 4). Now, suppose 
we move the first five stars in each row over to the left, so 
that a space separates them from the rest of the stars in 
the same row. The effect is to split our rectangle into two 
rectangles. One rectangle has three rows with five stars in 
each row, so it contains 3 • 5 stars. The other rectangle has 
three rows with four stars in each row, so it contains 3 • 4 
stars. Since we get the original rectangle by uniting the two 
smaller rectangles, the number of stars in the original 
rectangle is the sum of the numbers of stars in the two 
smaller rectangles. This fact is expressed in the statement 
that 3 • (5 + 4) = (3 • 5) + (3 • 4). We can verify the cor- 

********* ***** **** 

********* gaud, ***** united with **** 

********* ***** **** 

rectness of the statement by noting that 3 • 9 = 27, and 
15 + 12 = 27. In general, if a, b, and c, stand for natural 
numbers, a • (b + c) — (a • b) + (a • c). Similarly, (b + 
c) • a = (b • a) + (c • a). This rule is known as the distrib- 
utive law and expresses the fact that multiplication is distrib- 
utive with respect to addition. That is, the multiplier can be 
distributed among the individual terms in the expression it 
multiplies. In the statement of this law, multiplication and 
addition cannot change places. While 3 + (5 • 4) = 3 + 
20 = 23, (3 + 5) • (3 + 4) = 8 • 7 = 56, so that addition is 
not distributive with respect to multiplication. It is cus- 
tomary, in writing an expression like (a • b) + (a ■ c) to 
leave the parentheses out, so that it looks like this: a • b + 
a • c. In such an expression, which gives instructions for 
doing both multiplication and addition of some numbers, it 
is understood that the multiplications must be done first. 

The Five Laws 

We originally introduced the natural numbers as symbols 
for the cardinal numbers. Then we made these observa- 
tions about them: There are two binary operations defined 
on the natural number system, and we call them addition 
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and multiplication. The properties of these operations are 
embodied in the addition and multiplication tables. By ex- 
amining these tables, we found five laws that are obeyed 
by the natural number system: the commutative and as- 
sociative laws of addition, the commutative and associative 
laws of multiplication, and the distributive law which as- 
serts that multiplication is distributive with respect to 
addition. These laws have a special significance in the de- 
velopment of our notion of what a number is. We find that 
when we carry out computations with numbers we do not 
have to keep in mind their original meaning as cardinal 
numbers. It is enough to think of them as abstract symbols 
related to each other by addition and multiplication tables 
that obey these five laws. This fact suggests that we re- 
define numbers as follows: A number system is any collec- 
tion of objects on which two binary operations called 
addition and multiplication are defined, such that addition 
is commutative and associative, multiplication is commuta- 
tive and associative, and multiplication is distributive with 
respect to addition. 

This definition is a declaration of independence for the 
idea of a number system. It frees it from its cardinal- 
number ancestry and permits it to lead its own life. It 
allows it vO expand and grow. When a number system is 
defined in this way, we find that there is not just one num- 
ber system, but many number systems. We find, too, that 
it is possible for one number system to be part of a larger 
number system, which in turn is part of a still larger num- 
ber system, and so on. In fact, the core of this book is the 
systematic construction of larger and larger number sys- 
tems, using the natural numbers as a foundation. At each 
stage of the construction we shall recognize that we have 
built a number system when we find that it has two binary 
operations that obey the five laws: 

I. a + b = b + a 

II. (a -)- b) c = a -f- (b -f- c) 

III. a ■ b = b • a 
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IV. (a • V) - c = a • (b • c) 

V. a ■ (6 + c)=a-6 + a- c 
or ( b + c)-a = b- a + c- a 

Large and Small Numbers 

The natural number system has some other important 
characteristics besides the five laws. One of these is that 
we can compare any two numbers in it for size. The number 
5 is larger than 4, and 4 in turn is larger than 3. The notion 
of larger and smaller is derived from addition in this way: 
We say that b is larger than a if b is equal to the sum of a 
and some other natural number. For example, 5 is larger 
than 4, because 5 = 4 + 1; 5 is larger than 3, because 
5 = 3 + 2. 


One System with Many Disguises 

There are many different ways of writing the natural 
numbers. In the system of Arabic numerals that we use 
every day, the numbers one, two, three, four and five are 
written as 1, 2, 3, 4, 5. In Roman numerals, still used on 
clock faces and monuments, they are written as I, II, III, 
IV, V. In Hebrew they are written as the first five letters 
of the alphabet. If we think of these different systems of 
numerals as symbols for the cardinal numbers, then they 
are different ways of representing one and the same num- 
ber system. However, we may also think of each system of 
numerals as a separate number system in its own right, 
with addition and multiplication defined by its addition and 
multiplication tables. The Arabic, the Roman, and the 
Hebrew numerals could then be referred to legitimately as 
three separate number systems. But they are number sys- 
tems that can be used interchangeably, so, although they 
are separate systems, they are still somehow the same. In 
the next chapter we shall encounter number systems that 
are not interchangeable and may not be considered the 
same. In order to recognize when number systems are inter- 
changeable, and when they are not, we have to define what 
we mean when we say different systems are the same. 
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What we have in mind is that they have the same structure. 
For two number systems to have the same structure, each 
number in one system must have a counterpart in the 
other system. We can express this requirement in technical 
language by saying that there must be a mapping of one 
system into the other that places them in one-to-one cor- 
respondence. But the one-to-one correspondence alone is 
not enough. We want to be sure, too, that the results of 
computations in one system will correspond to the results 
of computations in the other system. So we say that two 
number systems have the same structure, or are isomorphic, 
if (1) there is a mapping of one into the other that puts 
them into one-to-one correspondence, and (2) under this 
mapping, sums and products are preserved. The require- 
ment can also be stated in this way: Under the mapping, 
each element in one system has an image in the other. 
Moreover, the image of the sum of two numbers is the sum 
of the images; and the image of the product is the product 
of the images. Comparing Arabic numerals and Roman 
numerals, for example, we can set up a one-to-one cor- 
respondence, shown in part in this table: 

1 < > I 

2 < » II 

3 < > III 

4 4 » IV 

54 » V 

6 4 » VI 


Each system has its own addition and multiplication tables, 
part of which is shown in the customary square arrange- 
ments below: 


+ 

1 

2 

3 

+ 

I 

II 

III 

1 

2 

3 

4 

I 

II 

III 

IV 

Addition 2 

3 

4 

5 

II 

III 

IV 

A' 

3 

4 

5 

6 

III 

IV 

V 

VI 
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Multiplication 


• 

1 

2 

3 

• 

I 

II 

III 

1 

1 

2 

3 

I 

I 

II 

III 

2 

2 

4 

6 

II 

II 

IV 

VI 

3 

3 

6 

9 

III 

III 

VI 

IX 


Under the mapping the image of 2 is II, and the image of 3 
is III. The sum of 2 and 3 is 5. The sum of II and III is 
V, which is the image of 5. So the sum of the images is the 
image of the sum. The product of 2 and 3 is 6. The product 
of II and III is VI, which is the image of 6. So the prod- 
uct of the images is the image of the product. Arabic nu- 
merals and Roman numerals, considered as separate num- 
ber systems, are isomorphic to each other. Although 
numbers in one system look different from numbers in the 
other system, the relationships within the systems, as ex- 
pressed in the addition and multiplication tables, have the 
same structure. So the two systems are really only one 
structure appearing in two different styles of dress. 

Zero 

Arabic numerals displaced all others because of their 
great convenience. They are most convenient to use be- 
cause they give us a way of writing an indefinite amount of 
numbers while using only a small number of symbols called 
digits. This feat is accomplished by attaching different 
meanings to the same digit. In the number 111, three one’s 
are used, and each has a different meaning. The 1 on the 
extreme right stands for the number one. The 1 in the 
second column from the right stands for the number ten, 
and the 1 in the third column stands for the number one 
hundred. The symbol stands for the sum of one, ten, and 
one hundred. Because the meaning of a digit depends on 
its position in the written numeral, we say the Arabic sys- 
tem of numerals is a place value system. To represent 
three hundreds plus two tens plus five ones, we write 325. 
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Now suppose we want to write the symbol for three tens. 
We put a 3 into the second column from the left. But we 
won’t recognize it as the second column unless we write 
something down in the first column. This makes it neces- 
sary to think of three tens as three tens plus no ones, and 
to introduce a symbol to represent the absence of ones. We 
use the symbol 0 for this purpose, and call it zero. The 
concept of a symbol representing none was first con- 
ceived by the Hindus, and was later taken over by the Arabs 
and built into their system of numerals. At first 0 was only 
a symbol representing none. Then it was promoted to full 
citizenship in the community of numbers by the introduc- 
tion of rules for addition of zero and of multiplication by 
zero that are consistent with the five laws obeyed by the 
system of natural numbers. The rules are these: zero plus 
any number gives that number again ; and zero times any 
number gives zero. The first of these rules can be written in 
symbols as follows : 

0 + x = x; x + 0 = x; 

where x may be any counting number or 0. The second rule 
can be written in symbols like this: 

0-2 = 0 ; 2-0 = 0 ; 

where x may be any counting number or 0. 

When 0 is adjoined to the system of counting numbers in 
this way, the result is an expanded number system called the 
system of whole numbers. (That is, 0, 1, 2, 3, 4, . . . are 
called whole numbers, but only 1, 2, 3, 4, . . . are called 
counting numbers or natural numbers.) The reader can 
easily verify that when addition of zero and multiplication 
by zero are defined by the rules given above, the system of 
whole numbers obeys the five laws given on pages 24 and 25. 
For example, since 0 + 3 = 3, and 3 + 0 = 3, it follows that 
0 + 3 = 3 + 0, so that the commutative law of addition is 
obeyed in this instance, when the numbers being added are 
0 and 3. 

In later chapters, we shall be building up some other 
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number systems. It will be necessary for us to find out 
whether these number systems contain an element that be- 
haves like the zero of the system of whole numbers. In our 
search for a zero element, we shall use the first rule for com- 
puting with zero as our criterion. If a number system con- 
tains a number a such that a + x = x, and x + a = x for all 
numbers x in that system, then we shall call a a zero ele- 
ment. 

One 

The distinguishing feature of a zero element is that add- 
ing it to another number leaves that number unchanged. 
There is a whole number that has the same relationship to 
multiplication that zero has to addition. This number is the 
number one, which obeys the rule: 1 • x = x, and x • 1 = x 
for all whole numbers x. That is, multiplying a number by 1 
leaves that number unchanged. In the number systems we 
explore later, we shall sometimes find an element that has 
this property, and shall call it a unity element. 

Identity Elements 

When we write out sums, we always use the symbol + 
to stand for the operation of addition. We could, if we 
wished, use some other symbol instead, as long as we agreed 
on its meaning. We might, for example, use the symbol * 
to represent the operation. In that case, the characteristic 
property of 0 could be written in this way : 0 * x — x, and 
x * 0 = x. In the same way, we could, if we wished, change 
the symbol for multiplication. If, temporarily, we used the 
symbol * to represent multiplication, then the characteristic 
property of 1 would *be written as follows: 1 * x — x, and 
x * 1 = x. 

The similarity in form of these two statements empha- 
sizes the fact that 0 and 1 really both have the same prop- 
erty, except that each has it in relation to another opera- 
tion. They are both examples of what is known as an iden- 
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tity element. In any system in which a binary operation is 
defined, and is symbolized by *, if there is an element e 
that has the property e * x = x, and x * e = x, for all values 
x in the system, then e is called an identity element. The 
letter e is used in this definition of an identity element be- 
cause it is the initial letter of the German word einheit, 
which means unity. 

Now we can state more precisely how the terms zero 
element and unity element are used in mathematics today. 
Whenever a binary operation is denoted by the symbol + 
and is called “plus,” the identity element for that operation 
is called a zero element, and is denoted by 0. Whenever a 
binary operation is denoted by the symbol •, and is called 
“times,” the identity element for that operation is called a 
unity element and is denoted by 1. We shall use this con- 
vention many times in later chapters. 

Points on a Line 

It is possible to represent the whole numbers as points 
on a line. On any straight line, choose a point and call it 0. 
This point divides the line into two half-lines. On one of 
these half-lines, choose another point and call it 1. Now 
continue locating points further and further away from 0 
by making the distance from each point to the next one 
the same length as the distance from 0 to 1. Label these 

0 1 2 3 4 5 


new points successively 2, 3, 4, 5, etc. We then have an 
endless sequence of points that is in one-to-one correspond- 
ence with the set of whole numbers. The number attached 
to each point is its distance from 0, expressed in terms of 
the distance from 0 to 1 as the unit of length. 

We can define addition and multiplication for these 
points by means of geometric constructions. Here, for ex- 
ample, is one way of doing it: To add a and b, measure out 
from a, in the direction away from 0, a length equal to 
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the distance from 0 to b. The point located in this way 
has a distance from 0 equal to a + b. To multiply a and b, 

length b 


r n 



length a 


length a+b 



length a length b 


first draw another line intersecting this one at 0. Use the 
same scheme for assigning numbers to points on this line. 
Locate points T, 2', 3', etc., on the line, so that successive 
points are separated by equal distances, all equal to the 
distance from 0 to 1'. Join V on the new line to a on the 
original line. Locate b' on the new line at a distance from 
0 equal to b. Then through b' draw a line parallel to the 
line just drawn from 1' to a. It will cross the original line 
at a point that will represent a • b. 

The construction for addition obviously corresponds to 
ordinary addition of whole numbers. The construction for 



multiplication corresponds to ordinary multiplication for 
this reason : If we designate by x the point that we have de- 
fined as the product of a and b, then x is its distance from 0. 
The triangles (0 T a) and (0 b' x ) are similar, so their corre- 
sponding sides are proportional. Then 1 : b = a : x. From 
this proportion, we find that x = a • b. With addition and 
multiplication defined by these constructions, the system 
of points on the half-line is isomorphic to the system of 
whole numbers. 

After we have assigned numbers in this way to points on 
a line, we find that there are still many points on the line 
that do not have numbers. All of our numbers except zero 
are on one side of 0. There are none on the other side. More- 
over, we have not assigned numbers to the points between 



those that represent successive whole numbers. For example, 
there are no numbers assigned to the points that lie between 
0 and 1. This is a defect that will be remedied step by step 
as we go along. One of our chief purposes will be to build up 
a number system that has enough points in it for us to assign 
a number for every point on the line. We hope to do it, too, 
in such a way that the expanded number system and the 
whole line will be isomorphic. 
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The Natural Numbers 

Let us return for a moment to the system of natural num- 
bers, the original system of counting numbers that does not 
include zero as a number. On page 25 we encountered two 
different systems that are capable of representing the natu- 
ral numbers. One system consists of the common Arabic 
numerals, with their addition and multiplication tables. A 
second one consists of the Roman numerals, with their 
tables. There is a third system in which each counting num- 
ber is pictured as a point on a line, and addition and multi- 
plication are defined by the constructions described above. 
This variety of representations raises the question, “What is 
the natural number system, anyhow?” 

We might try to answer this question by listing some 
characteristics that all these number systems have in com- 
mon. All three, for example, obey the five laws. But this is 
not an adequate answer, because all number systems, as we 
have defined the term, will obey these five laws. And we 
intend to produce some number systems that are not inter- 
changeable with the natural number system at all. To define 
the natural number system, we must list not merely charac- 
teristics that all of its representations have in common. We 
must note particularly its distinguishing characteristics. 
This is done by choosing the defining characteristics in such 
a way that all systems that have these characteristics must 
be isomorphic to each other. Such a selection of character- 
istics that effectively defines one and only one structure is 
called a system of axioms for the structure. Here is a system 
of axioms for the natural number system, first formulated 
by the mathematician Peano: 

A set of elements is called a natural number system if it 
has the following characteristics: 

(1) It contains an element called 1. 

(2) For every member in the system, there is another 
member (and only one) called its successor. 

(3) Two distinct members do not have the same suc- 
cessor. 
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(4) There is no member of the system that has 1 as its 
successor. 

(5) If a set of elements belonging to the system contains 
1, and, for each member that it contains, also con- 
tains its successor, then this set contains the whole 
system. 

Notice that addition and multiplication are not men- 
tioned in these axioms at all. Peano defined these operations 
in terms of his axioms as follows : For any natural numbers 
x and y, 

let x -f 1 = the successor of x; 

let x + (the successor of y ) = the successor of (x + y) ; 
let x -1 = x; 

let x • (the successor of y) = x ■ y + x. 

With these definitions it is possible to prove that the 
natural number system obeys the five laws. 

What Peano did for the natural number system is typical 
of the way in which mathematical structures are studied 
today. In modern mathematics, a mathematical structure is 
often defined as a set of objects that satisfies a specified 
set of axioms. If the structure defined is to be unique, the 
axioms are chosen so that all systems that satisfy the 
axioms will be isomorphic to each other. Different sets 
of axioms have been formulated for the various mathemati- 
cal structures needed in practical applications. 

DO IT YOURSELF 

1. By using double-headed arrows, as on page 26, set up 
a one-to-one correspondence between the numbers 1, 2, 
3, 4, 5 and the letters a, e, i, o, u. 

2. An addition operation for the system consisting of two 
elements, a and b, is defined by the following table: 


+ 

a 

b 

a 

a 

b 

b 

b 

a 
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a) Does this system have a zero element? 

b) Show that addition is commutative in this system. 

c) Verify from the table that a + (a + b) = (a + a) 
+ b. 

3. Let the symbol M stand for the binary operation, “take 
the maximum of.” For example, 5 M 7 means 7 ; 8 M 3 
means 8; 6 M 6 means 6. Compare 8 M 3 with 3 M 8. 
If a and b are any two natural numbers, compare a M b 
with b M a. Is the operation M commutative? Compare 
8 M (3 M 7) with (8 M 3) M 7. In general, if a, b, and c 
are any three natural numbers, compare a M (b M c) 
with (aM b) M c. Is the operation M associative? 
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CHAPTER II 


Number Systems without 
r Numbers ” 


THE word “number,” as we use it in everyday life, 
refers to a symbol associated with counting or measuring. 
We have broken away from this usage in the definition of a 
number system given in Chapter I. We defined a number 
system as any set of objects on which two binary operations 
are defined that obey the five laws listed on pages 24-5. In 
this definition, there is no reference to counting or measur- 
ing. The five laws are concerned only with the way in which 
the numbers are related to each other by the addition and 
multiplication tables. To emphasize this fact, we separated 
the concept of natural number from that of cardinal num- 
ber. While cardinal numbers are properties of actual sets 
of objects, and are intrinsically related to counting, natural 
numbers are abstract symbols whose entire meaning lies in 
the formal rules by which we manipulate them. 

Nevertheless, the natural numbers do not convey the full 
meaning of our break with common usage in the definition 
of number. Introducing the natural numbers effected a 
separation from the cardinal numbers, but not a divorce. 
The cardinal number system is still lurking in the back- 
ground, because it is isomorphic to the natural number 
system. This fact may arouse the suspicion that no signifi- 
cant change in the concept of number has really been intro- 
duced, and that numbers are still essentially bound up 
with counting and measuring. However, a real change has 
been introduced by our definition of number system. The 
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purpose of this chapter is to demonstrate this fact convinc- 
ingly by producing some number systems without “num- 
bers.” These number systems will consist of elements that 
have no direct relationship to counting or measuring, and 
so are not “numbers” in the sense in which the word is 
commonly used. However, they will be genuine number 
systems in the sense of our definition. 

Subsets of a Set 

We shall construct these number systems with the help 
of the simple notion of a set. A set is any collection of 
objects. The objects that belong to a set are called its 
elements. A set is defined by specifying which objects are 
elements of the set. This may be done by stating some rule 
by which the elements can be identified, or by actually 
putting the elements on display. The symbol commonly 
used for a set is a pair of braces, with the elements of 
the set on display inside, or with the rule by which they are 
identified printed inside. For example, here is a set defined 
by a rule: 

{natural numbers larger than 4, but less than 10} 

The same set can be represented by putting its elements on 
display: 

{5, 6, 7,8,9} 

Other sets can be formed from a given set by removing 
some of its elements. For example, if we remove the ele- 
ments 5, 7, and 8 from the set shown above, we are left with 
the set {6, 9}. It is convenient to extend the notion of set to 
include what is left if we remove all the elements. It 
will be a “set” with no elements in it, and will be referred 
to as the empty set. To symbolize it, we shall show a pair 
of braces with no elements on display inside. A set obtained 
by removing none, or some, or all of the elements of a given 
set is called a subset of that set. For example, the set 
{x, y, z} has eight subsets, listed below: 


36 


Notice that the given set is one of its own subsets, and the 
empty set is one of the subsets, too. 

Operations on Subsets 

To define a number system, we must first specify what 
the elements of the number system are. We shall use as 
elements all the subsets of a given set. As a specific example, 
let us build a number system out of the subsets of the set 
{x, y, z}. For convenience in talking about them, let us 
assign a name to each of these subsets. We shall use capital 
letters for their names, as follows: 


7 = {x, y,z } 

D = {x} 

A = {x,y } 

E = \y\ 

B = {x, z} 

F = {z} 

{ y,z\ 

0= { 1 


The symbols 7 and 0 are included among the names used 
for reasons that will become clear later. 

The next step is to define two binary operations on these 
elements. A binary operation is defined when we set up 
some rule for assigning to each ordered pair of subsets some 
particular subset in the same list. We define the operation 
of forming a union of two subsets by means of this rule: 
The union of two subsets is another subset formed by taking 
as its elements those elements that are in one or the other of 
the subsets being united. For example, A contains the 
elements x and y. B contains the elements x and z. The 
elements that are in one or the other are x, y, and z. So the 
union of A and B is the set \x,y,z), which we have called 7. 
The union operation will be the “addition” operation of 
this number system. However, we shall not use the plus 
sign to represent it. Instead, we shall use the symbol \J. 
The union of A and B will be written as A KJ B, and is read 
as “A union B.” We have seen that A U B = 7. The method 
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of finding the union of two subsets will be clear from the 
following examples : 

I V C = [x, y,z) VJ {y, z} = [x, y, z} = I 

D yj E = {x} yj {y} = {x ,y) = A 

C WO = {y,z} yj { } = [y,z\ = C 

The results of forming all possible unions can be sum- 
marized in this table of unions (the addition table for the 
number system we are constructing) : 


yj 

I 

A 

B 

C 

D 

E 

F 

0 

i 

I 

I 

I 

I 

I 

I 

I 

I 

A 

I 

A 

I 

I 

A 

A 

I 

A 

B 

I 

I 

B 

I 

B 

I 

B 

B 

C 

I 

I 

I 

C 

I 

C 

C 

C 

D 

I 

A 

B 

I 

D 

A 

B 

D 

E 

I 

A 

I 

C 

A 

E 

C 

E 

F 

I 

I 

B 

C 

B 

C 

F 

F 

0 

I 

A 

B 

C 

D 

E 

F 

0 


The second binary operation we define is that of form- 
ing the intersection of two subsets. The intersection of 
two subsets is another subset formed by taking as its ele- 
ments all those elements that are in both of the subsets being 
intersected. For example, A contains the elements x and y. 
B contains the elements x and z. Only the element x is in 
both A and B. So the intersection of A and B is the subset 
{xj, which we have called D. The intersection operation 
will be the “multiplication” operation of the number system 
we are constructing. We shall designate it by the symbol Pi. 
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The intersection of A and B will be written as A n B, and 
is read as “A intersection B.” Then we see that A n B = D. 
When two subsets have no elements in common, their inter- 
section is the empty set. The method of finding intersections 
is shown in the following examples: 

I r\C = {x,y,z} n {y, z } = [y,z\ = C 
A r\ D = {x,y} n {x } = | x } = D 

Bn o = {x,z} n {}={}= 0 

e n F = {y} n {z} = { } = o 

The results of forming all possible intersections can be sum- 
marized in this table of intersections (the multiplication 
table for the number system we are constructing) : 


n 

I 

A 

B 

C 

D 

E 

F 

0 

i 

I 

A 

B 

C 

D 

E 

F 

0 

A 

A 

A 

D 

E 

D 

E 

0 

0 

B 

B 

D 

B 

F 

D 

0 

F 

0 

C 

C 

E 

F 

C 

0 

E 

F 

0 

D 

D 

D 

D 

0 

D 

0 

0 

0 

E 

E 

E 

0 

E 

0 

E 

0 

0 

F 

F 

0 

F 

F 

0 

0 

F 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 


The Five Laws Are Obeyed 

The operations “union” and “intersection” obey the 
five laws listed on pages 24-5. We can verify this fact by 
referring back to the meaning of these operations. Let us ex- 
amine the laws one at a time, to see if they are obeyed. 
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1. The commutative law of addition. Union is our addition 
operation, so we must see whether X \J Y = Y VJ X, where 
X and Y represent any subsets of I. X W Y means the set 
that consists of those elements that are in X or Y. Y \J X 
means the set that consists of those elements that are in 
Y or X. These are obviously the same sets, so law number 1 
is obeyed. 

2. The associative law for unions. We must see whether 
(lUF)UZ = XU(ywZ). The set (IU7)UZ is 
the set consisting of elements that are in X or Y, or in Z. 
The set X U (Y VJ Z) is the set consisting of elements that 
are in X, or in Y or Z. These are obviously the same sets, 
so that law number 2 is obeyed. 

3. The commutative law of multiplication. Intersection is 
our multiplication operation, so we must see whether 
Xr\Y=Y(~\X.Xr\Y means the set consisting of ele- 
ments that are in both X and Y.Y r\X means the set con- 
sisting of elements that are in both Y and X. These are 
obviously the same sets, so law number 3 is obeyed. 

4. The associative law for intersections. (X (~\Y) (~\ Z 
means the set consisting of elements that are in X and Y, 
and also in Z. X C\ {Y C\ Z) means the set consisting of 
elements that are in X, and also in Y and Z. Clearly, then 
(X r\ Y) n Z = X n (Y n Z), and law number 4 is 
obeyed. 

5. The distributive law. We must see if X H (F U Z) = 
(X r\ Y) yj (X n Z). X r\ (Y \JZ) means the set consist- 
ing of elements that are in X and in 7 or Z. (X C\ Y) U 
(X r\ Z) means the set consisting of elements that are in 
X and Y, or in X and Z. These are clearly the same sets, 
so law number 5 is obeyed. 

Since the five laws are obeyed, the system of subsets of 
I, with the operations union and intersection, forms a num- 
ber system. Similar number systems can be constructed 
from the subsets of any given set. In the example just given, 
we started with a set that contains three elements, and 
found that it has eight subsets. As a result we obtained a 
number system that contained exactly eight members. Had 
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we started with a different number of elements, we would 
have obtained a number system with a different number 
of members. For example, a set with two elements has four 
subsets. A set with four elements has sixteen subsets. A 
set with five elements has thirty-two subsets. In general, 
a set with n elements has 2” subsets. 

Zero and Unity Elements 

The number system we have constructed has a zero ele- 
ment and a unity element. Since union is our addition oper- 
ation, a zero element would have to have the property that 
when it is united with any element of the system, it leaves 
that element unchanged. A glance at the union table on 
page 38 shows that the empty set has this property. That 
is why we used the symbol 0 to represent it. Since intersec- 
tion is our multiplication operation, a unity element would 
have to have the property that when it is intersected with 
any element of the system, it leaves that element un- 
changed. A glance at the intersection table on page 39 
shows that the original set I has this property. We chose the 
symbol I to represent it because of its resemblance to the 
number 1. 

Special Properties 

The number system we have just constructed out of the 
subsets of {x, y, zj has, as we have seen, some properties 
that it shares with the natural number system. These in- 
clude obedience to the five laws, and possession of a zero 
element and a unity element. However, it also has some 
peculiar properties that are entirely unlike the properties 
of the natural number system. A few of these are noted 
here. 

1. We can see from the tables that for any element X in 
the system, X VJ X = X, and I H I = I. That is, a sub- 
set united with itself yields the same subset, and a subset 
intersected with itself yields the same subset. In the natural 
number system, such an outcome is the exception rather than 
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the rule. 0 + 0 = 0, but 2 + 2 is not 2. 1 • 1 = 1 , but 2 • 2 
is not 2. 

2. We have already observed that intersection is distribu- 
tive with respect to union. It can also be verified that union 
is distributive with respect to intersection. That is, in the 
statement of the distributive law, union and intersection J 
can change places. This, too, is unlike what we found in the 
natural number system. There, while multiplication is 
distributive with respect to addition, addition is not dis- 
tributive with respect to multiplication. 

3. For each subset in the system, we can find another 
one that contains just those elements that the first one does 
not. We call this second subset the complement of the first 
one, because, while they do not overlap (their intersection 
is 0), together they complete the original set (their union 
is 7). If X is any subset in the system, we denote its com- 
plement by X'. In the system of subsets of {x, y, z}, A — 

{x, y}, so A' = {z}=F. Similarly, B' = E, C' — T), and 
/' = 0. The operation of “taking the complement” has the 
following properties: 

xnr = o,iur = /; (xy = x. 

It also obeys the very useful law known as De Morgan’s 
Rule : The complement of a union is the intersection of the 
complements; and the complement of an intersection is the 
union of the complements. Written out in symbols, the law 
says: 


(X yj y y = x’ r\ Y'; (x r\ Yy = x' *j y\ 

The truth of the law can be observed by noting that a union 
consists of elements in one set or the other, an intersection 
consists of elements in one set and another, and a comple- 
ment consists of the elements not in a particular set. Then 
De Morgan’s rule says that “not in either X or Y” is the 
same as “not in X and not in F” ; and that “not in both X 
and F” is the same as “not in X or not in F.” A little thought 
will show that these statements are correct. 


42 


The Algebra of Logic 

The number system we have constructed in this chapter 
is only one of a whole family of number systems that have 
similar properties. They are called Boolean algebras. The 
type of structure that they represent is not just a mathe- 
matical curiosity. It has an important practical application 
in the study of logic, and in the design of electronic com- 
puters. In logic we study relationships among statements. 
The analysis of these relationships can be carried out in 
symbols in the following way : Let each proposition or state- 
ment be represented by a letter, such as p, q, or r. Use the 
symbol VJ for “or,” the symbol C\ for “and,” and the symbol 
' for “not,” as we already have done in the last paragraph. 
Use 0 for any statement that is false, and I for any statement 
that is true. With this notation, the class of statements and 
their logical relations becomes a Boolean algebra. Boolean 
algebras are named after the English mathematician, George 
Boole, who pioneered in the study of symbolic logic. 

DO IT YOURSELF 

1. Assign names to the subsets of the set {x,y} as follows: 

I = {x, y), A = {x},B = {y}, 0 = { } = theempty 
set. 

a) Construct a table for the union operation for this 
system of subsets. 

b) Construct a table for the intersection operation. 

2. Let I represent the set {a, b, c, d, e,f,g,h). 

Let X represent the subset {a,b, c, d}. 

Let Y represent the subset {a, b, e,f, g}. 

a) What elements are in X' , the complement of X in 7? 

b) What elements are in Y', the complement of Y in 7? 

c) What elements are in X' VJ F'? 

d) What elements are in X C\ F? 

e) What elements are in ( X P\ F)'? 

f) Compare your answers to c) and e) to show that 
X' VJ Y' = (X n Y)'. 

3. List all the sixteen subsets of the set {a,b,c,d j. 
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CHAPTER III 


New Numbers from Old 

More Expansions 

IN Chapter I we began with the system of natural 
numbers, or counting numbers, and expanded it to form the 
system of whole numbers by adjoining the new number, 
zero, to the counting numbers. This was the first of a series 
of expansions of the number system that we shall carry out. 
The second one is the subject of this chapter. 

In our everyday use of whole numbers, besides adding 
them and multiplying them, we sometimes have occasion to 
subtract them. The operation of subtraction can be defined 
in terms of addition. The symbol 5 — 3 really asks us the 
question, “What whole number added to 3 gives 5?” Since 
the answer to the question is the number 2, we say 5 — 3 = 
2. We call the answer the difference between 5 and 3. The 
question can also be written in the form of an equation, 
x + 3 = 5, and the answer to the question is the solution to 
the equation. 

Our success in finding the difference between 5 and 3 
tempts us to try to find the difference between any two 
whole numbers chosen at random. But then we run into 
trouble. Suppose, for example, we try to find the difference 
between 3 and 5, written as 3 — 5. First we have to interpret 
the symbol as a question. It asks us, “What whole number 
added to 5 gives 3?” Unfortunately, the answer is that there 
isn’t any such number. In the system of whole numbers, we 
cannot subtract any number from any other number. The 
only time subtraction is possible in that system is when the 
subtrahend is not larger than the minuend. If a and b stand 
for any whole numbers, then the expression a — b doesn’t al- 
ways have a meaning. If we think of it as the question, 
“What whole number added to b gives a?”, then it doesn’t 
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always have an answer. If we use the equation x -f- b = a, 
which asks the same question, then it does not always have a 
solution. This is a defect of the system of whole numbers 
that limits its usefulness. Because, although the question 
3 — 5 is meaningless for the system of whole numbers, there 
are practical problems that lead to just such a question. For 
example, if the temperature is 3 degrees, what will it be after 
the mercury drops 5 degrees? It would be useful to have a 
number system which contains a number that can serve as 
the answer to this question. The defect of the system of 
whole numbers that we have observed confronts us with a 
challenge. Can we construct a number system that does not 
have this defect? Can we build a number system in which 
subtraction is always possible for any pair of numbers taken 
in any order, so that a — b always has a meaning, and 
x + b = a always has a solution? We find that we can. 

Readers who have had high school algebra will remember 
that a system of numbers that includes “negative” as well 
as “positive” numbers is supposed to serve this purpose. 
But in their course in high school algebra, they were given 
this system as a finished product obeying certain mysterious 
rules such as, “the product of two negative numbers is a 
positive number.” In what follows, we do not take the exist- 
ence of such a number system for granted. We prove it exists 
by actually constructing it. We also remove the mystery 
surrounding its rules by deriving them from the familiar 
rules governing the system of whole numbers. 


Families of Differences 

To construct the improved number system, we use a 
rather interesting device. The symbol a — b asks us a ques- 
tion which does not always have an answer. To make sure 
that it will have an answer in the new system, we let the 
question be its own answer! In effect we say, let each 
expression like 5 — 3, or 3 — 5, or 2 — 7, represent a num- 
ber in the new system. To justify calling these strange 
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things numbers we shall have to define addition and multi- 
plication operations for them, and then show that with 
these operations they really constitute a number system. 

However, we run into some complications even before we 
take our first step in this direction. In the system of whole 
numbers, 5 — 3 does have an answer, and the answer is 2. 
But 2 — 0, 3 — 1,4 — 2, 6 — 4, and an endless list of similar 
symbols also represent 2. So we cannot simply let each such 
symbol stand for a separate number in the new system. We 
would want all of these symbols to represent the same num- 
ber, just as they do in the system of whole numbers. We take 
care of this difficulty by using as the elements of our new 
number system, not single symbols written in the form 
of a “difference” between two whole numbers, but whole 
families of such differences. The first step is to establish 
a rule by which we can recognize when two such symbols 
belong to the same family. We get a clue to the rule we 
should use by examining the difference symbols that repre- 
sent the number 2. The difference 3—1 and the difference 
6 — 4 represent the same number. Notice that if we add 
the left number of each symbol to the right number of 
the other, we get the same sum: 3 + 4 = 6 + 1. We shall 
use this relationship as the criterion for identifying differ- 
ences that belong to the same family. 

Now we are ready to carry out our construction step by 
step. First we take all possible ordered pairs of whole 
numbers, such as 7 and 5, 3 and 9, 15 and 1, and so on. Then 
we write the “difference” of the numbers in the pair, taken 
in a definite order. Since the difference does not always 
have a meaning in the system of whole numbers, we shall 
not use an ordinary minus sign when we write it. We shall 
use the symbol ~ instead, to remind us that this is really not 
subtraction of whole numbers, but merely a symbol sug- 
gested by subtraction. So we now have symbols like 7 ~ 5, 
3 ~ 9, 15 ~ 1, and so on, each of which will be called a 
“difference” between whole numbers. 

Now we associate with each difference a whole family 
of differences in the following way: The family belonging 
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to a difference a ~ b consists of all those differences u ~ v 
for which a + v = u + b. To designate the family that be- 
longs to a difference we shall write that difference in par- 
entheses. Thus (a ~ b) means the family of differences that 
belongs to a ~ b. The symbol (3 ~ 1) means the family of 
differences that belongs to 3 ~ 1. We have already seen that 
the difference 6 ~ 4 belongs to this family because 3 + 4 
= 6 + 1. We call these families of differences integers. They 
will be the elements of our new number system. 

We observe immediately two characteristics of these 
families which we call integers : 

1) A difference belongs to its own family. For example, 

3 ~ 1 belongs to (3 ~ 1). This follows from the fact that 

u ~ v belongs to (a ~ h) if a + v ~ u + b. In this case, 

a — 3, b = 1, u = 3, and v = 1, and 3 + 1 = 3 + 1. In gen- 

eral, a ~ b belongs to (a ~ b ) because a + b = a + b. 

2) If one of two differences belongs to the family of the 
other, then they have the same families. Suppose, for ex- 
ample, that a~6 belongs to the family (c~d). Then 
we can show that every member of (a ~ b) belongs to 
(c ~ d), and vice versa. If p ~ q belongs to (a ~ b), then 
by the criterion for membership in a family, a + q = p + b. 
However, a~b belongs to (c — c?), so c + b = a + d. 
Adding these two equalities, we get a + 6 + c + g = a + 6 
+ V + d. Taking away a + b from both sides, we get c + q 
= p + d. But this is equivalent to saying that p ~ q be- 
longs to (c ~ d), according to our criterion for membership 
in a family. This shows that any member of (a ~ b) also 
belongs to (c ~ d). A similar argument, going through the 
same chain of steps in reverse, shows that every member of 
(c ~ d) also belongs to (a ~ b). So (a ~ b) and (c ~ d) 
have the same memberships, and are therefore the same. 

The second characteristic of these families called integers 
has these consequences: First, each difference a ~ b belongs 
to one and only one integer. Secondly, an integer may be 
represented by putting on display inside parentheses any 
one of the differences that belong to it. So (3 ~ 1), (4 ~ 2), 
(5 ~ 3) all represent the same integer. Thirdly, the cri- 
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terion for membership in an integer can also serve as a test 
for equality of integers. That is, the integers (a ~ b ) and 
(c ~ d) are equal if and only if a + d = c + b. For exam- 
ple, to prove that (3 ~ 1) = (4 ~ 2), it is enough to ob- 
serve that 3 + 2 = 4 + 1 . 


Addition and Multiplication of Integers 

Now we define addition and multiplication for the system 
of integers. We assign a sum to any ordered pair of integers 
by means of the following defining equation : 

(a ~ b) + (c ~ d) = (a + c ~ b + d) 

The symbol on the right hand side of this equation repre- 
sents an integer, because, if a and c are whole numbers, 
a + c is also a whole number. Similarly, b + d is a whole 
number. Then a + c~h + d is a difference of whole num- 
bers, and there is an integer that belongs to it. 

We assign a product to any ordered pair of integers by 
means of the following defining equation : 

(a ~ b) ■ (c ~ d) = (a ■ c + b ■ d ~ a • d + b ■ c) 

Here, too, the right hand side represents an integer, because 
the symbol inside the parentheses represents a difference 
of two whole numbers. 

Notice that to find the sum of two integers, we make use 
of the whole numbers whose differences are on display 
inside the parentheses representing these integers. This fact 
leads to a problem that we have to pay attention to. Each 
integer is a family of differences. Any member of a family 
might be put on display to represent it. If we pick another 
difference to represent each of the integers we are adding, 
will we still get the same sum? If we do not, our definition 
is useless. However, the definition was well chosen. If we 
follow the directions it gives for finding the sum of two 
integers, we arrive at the same result no matter which mem- 
bers of the integers are used to represent them. We shall 
not take the time to prove this fact here, but shall merely 
verify it for a particular case. Suppose we want to add 
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(5 ~ 3) and (6 ~ 5). Using our definition, we find that 
(5 ~ 3) + (6 ~ 5) = (5 + 6 ~ 3 + 5) = (11 ~ 8). 

However, (5 ~ 3) could also be represented by (4 ~ 2), 
because 5 + 2 = 4 + 3. Similarly, (6 ~ 5) could also be 
represented by (5 ~ 4), because 6 + 4 = 5 + 5. If we apply 
our definition to these other representatives of the two 
integers, we find that 

(4 ~ 2) + (5 ~ 4) = (4 + 5 ~ 2 + 4) = (9 ~ 6). 

By using different representatives for the two integers we 
were adding, we got sums that look different. However, al- 
though they look different, the sums are the same. 

(11 ~ 8) = (9 ~ 6), because 11 + 6 = 9 + 8. 

The same problem arises in connection with our defini- 
tion for multiplication of integers. The definition makes use 
of a particular difference that belongs to each integer. But 
it can be shown that it does not matter which difference 
that belongs to an integer is chosen as its representative. 
They all lead to the same product anyhow. So there is no 
ambiguity in our definitions of addition and multiplication. 

The Integers Form a Number System 

We now have a system of elements called integers, with 
an addition operation and a multiplication operation de- 
fined for this system. To show that the integers form a 
number system, we have to prove that the operations obey 
the five laws listed on pages 24-5. As an example of how such 
a proof is carried out, we give the details of the proof for 
the commutative law of addition. Let (a ~ b) be any in- 
teger, and (c ~ d) any other integer. We must show that 
(a ~ b) + (c ~ d) = (c ~ d) + (a ~ b). Applying our def- 
inition of addition of integers, we find that (a ~ b ) + 
(c ~ d) — (a + c ~ b + d), while (c ~ d) + (a ~ b) = 

(c + a ~ d + b). But whole numbers obey the commuta- 
tive law for addition, so a + c = c + a, and b + d = d + b. 
This shows that (a + c ~ b + d) and (c + a ~ d + b) 
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are the same integer. Therefore, (a ~ b) + (c ~ d) = 
(c ~ d) + (a ~ b), and the commutative law for addition 
of integers is true. The other four laws are proved by similar 
arguments, using the definitions of addition and multiplica- 
tion of integers, and the fact that whole numbers are known 
to obey the five laws. 

Zero and Unity 

Let us add the integer (0 ~ 0) to any other integer 
(a ~ b). Following the definition of addition, we find that 
(0 ~ 0) + (a ~ b) = (0 + a ~ 0 + b) — (a ~ b), since 
0 + a = a, and 0 + b = b. In other words, when (0 ~ 0) 
is added to any other integer, it leaves that integer un- 
changed. Therefore (0 ~ 0) is a zero element for the system 
of integers. The integer (0 ~ 0), like any other integer, is 
a family of differences, and may be represented by any one 
of these differences. We can identify what these differences 
look like by using the criterion for belonging to an integer. 
The difference x ~ y belongs to the family of differences 
(0 ~ 0), if and only if 0 + y = x + 0, or y — x. That is, 
a difference belongs to the integer (0 ~ 0) if and only if the 
left number and the right number are equal. So (1 ~ ■ 1), 
(2 ~ 2), (3 ~ 3), and so on, are other ways of writing the 
zero element in the system of integers. 

Let us see how the integer (1 ~ 0) behaves under multi- 
plication. Following the definition of multiplication, we 
find (l~0)-(a~6) =(l-a + 0- 6~l-b-|-0-a) = 
(a + 0 ~ b + 0) = (a ~ b). In other words, when (1 ~ 0) 
is multiplied by any other integer, it leaves that integer un- 
changed. So (1 ~ 0) is a unity element for the system of 
integers. It may also be written in the form (a + 1 ~ a), 
where a is any whole number. This follows from the test for 
equality of integers, because a+l + 0 = l + a. 


The Negative of an Integer 

Our purpose in constructing the system of integers was 
to find a number system in which an equation of the form 
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X + B = A always has a solution. To show that we have 
achieved our purpose, we have to introduce a new concept, 
the concept of the negative of a number. We say that one 
number is the negative of another if the sum of the two 
numbers is zero. In the system of whole numbers, there is 
only one number that has a negative. That number is 0. and 
it is its own negative, because 0 + 0 = 0. No other whole 
number has a negative, because if a whole number that is 
different from 0 is added to any other whole number, the 
sum is different from 0. However, the system of integers is 
altogether different in this respect: In the system of in- 
tegers, every number has a negative. We prove this fact by 
actually producing the negative of any integer. Let (o ~ b ) 
be any integer. Then {b ~ a) is its negative, because 
(a ~ b) + (6 ~ a) = (a + b ~ b + a) = the zero integer, 
because, in the difference on display inside the parentheses, 
the left number is equal to the right number. Because every 
integer has a negative, we introduce a special symbol to 
mean “negative of.’’ The minus sign is used for this purpose. 
If A stands for an integer, then —A is used to represent the 
negative of A. 

Now we are prepared to show that the equation 
X + B = A always has a solution, ii X, A, and B stand for 
integers. We have used capital letters here to represent 
integers, so that we would not confuse them with whole 
numbers. Each integer is a family of differences of whole 
numbers, so the equation can also be written in this form: 
(+ ~ y) + (c ~ d) — (a ~ b). We solve the equation by 
adding the negative of (c ~ d ) to both sides. We get 

(x ~ y) + (c ~ d) + (d ~ c) = (a ~ b) + (d ~ c ). 

But the sum of (c ■ — d) and (d ~ c) is the zero integer. And 
the zero integer added to ( x ~ y ) leaves it unchanged. So 
we have (x ~ y) = (a ~ b) + (d ~ c) = (a + d ~ b + c). 
For example, to solve (x ~ y) + (8 ~ 1) = (3 ~ 2), add 
(1 — 8) to both sides. Then we get (x ~ y) = (4 ~ 10). 

On page 44 we saw that solving X + B = A was another 
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way of saying subtract B from A. Since, in the system of 
integers, the equation always has a solution, it means that 
subtraction is always possible in that system. In fact, our 

method for solving the equation suggests an appropriate 
definition for the subtraction of integers: To subtract an 
integer, means to add its negative. This definition makes 
sense in the system of integers, where every number has a 
negative. We could not have defined subtraction of whole 
numbers in the same way, because in the system of whole 
numbers, it is not true that every number has a negative. 

We Still Have the Whole Numbers 

Among the integers, there are some special integers like 
(0 — 0) , ( 1 — 0) , (2 ~ 0), (3 ~ 0), and so on, which are 
written by putting on display a difference in which the 
number on the right is 0. These special integers are called 
non-negative integers. They can be placed in one-to-one cor- 
respondence with the whole numbers by pairing off 0 with 
(0 ~ 0), 1 with ( 1 — ' 0) , 2 with (2 — 0) , and so on. In gen- 
eral, in this correspondence, each whole number a is matched 
with the non-negative integer (a ~ 0). 

Now let us see what happens when we add or multiply 
any two non-negative integers, (a ~ 0) and (t> — 0) . For the 
sum, we get (a ~ 0) + (b ~ 0) = (a + b ~ 0). So the sum of 
two non-negative integers is a non-negative integer. For the 
product, we get (a ~ 0) • (b ~ 0) = (a ■ b + 0 • 0 ~ a • 0 
+ 0 • b) = (a • b ~ 0). So the product of two non-negative 
integers is a non-negative integer. Moreover, the sum of 
the integers matched with a and b is the integer matched 
with a + b ; and the product of the integers matched with a 
and b is the integer matched with a ■ b. That is, under the 
one-to-one correspondence the sum of the images of two 
whole numbers is the image of their sum ; and the product of 
the images is the image of the product. 

So the non-negative integers are isomorphic to the whole 
numbers. Because of the isomorphism, they can be used in- 
stead of the whole numbers, just as Roman numerals 
can be used instead of Arabic numerals. In this sense, we say 
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that the non-negative integers are the “same” as the whole 
numbers. We take advantage of the isomorphism by using 
the notation for whole numbers as an abbreviated nota- 
tion for the non-negative integers. In this abbreviated nota- 
tion 0 represents (0 ~ 0), 1 represents (1 — 0), and so on. 

Since the system of integers includes the non-negative in- 
tegers which are just “like” the whole numbers, they consti- 
tute an extension of the system of whole numbers. Using the 
system of integers instead of the system of whole numbers 
gives us a double advantage: We eliminate a defect of the 
system of whole numbers, without losing the whole numbers 
themselves. 

One of the non-negative integers is the integer (0 ~ 0) 
which is the zero element of the system of integers. If we 
remove this integer from the set of non-zero integers, the 
subset that is left is called the set of positive integers. That 
is, every non-negative integer (a ~ 0) where a is not 0 is 
called a positive integer. Just as the non-negative integers 
are like whole numbers, the positive integers are like natural 
numbers or counting numbers, and can be used in their 
place. 

The Negative Integers 

Every integer can be represented by a difference in which 
either the left number or the right number is 0. This can be 
done by simply subtracting from each of these whole 
numbers the smaller of the two. The result will be a differ- 
ence that belongs to the integer, and so can be used to 
represent it. For example, the integer (8 ~ 3) is equal to 
(5 ~ 0). We know they are equal because 8 + 0 = 5 + 3. 
The integer (3 ~ 8) is equal to (0 ~ 5), because 3 + 5 = 
0 + 8. So every integer may be written in the form (o ~ 0) 
or (0 ~ a). Those that can be written in the form (a ~ 0), 
where a is not 0, are the positive integers. Those that can be 
written in the form (0 ~ a), where a is not 0, are called nega- 
tive integers. Each of them is the negative of a positive in- 
teger. Using the minus sign as the symbol for “negative of,” 
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we get an abbreviated notation for them, too, by writing 
—a for (0 ~ a). 

This is the form in which students are introduced to them 
for the first time in courses in elementary algebra. The fa- 
miliar rules for calculating with these symbols can all be de- 
rived from our definitions for the addition and multiplica- 
tion of integers. For example, the rule that the product of 
two negative integers is a positive integer can be proved as 
follows: If neither a nor b is 0, (0 ~ a) ■ (0 ~ b) = (0 • 0 
+ a- 6~0-6 + a-0) = (a-b~0), which is positive, 
because a • b is not 0 when neither a nor b is 0. 

The positive integers, being essentially carbon copies of 
the natural numbers, can be represented as the natural 
numbers were on page 30, by points on the half-line that 
lies to the right of the 0. We can represent the negative 
numbers pictorially, too, by putting them on the other half 
of the line, on the other side of 0. Arranging them on a line 
suggests that we can talk about larger and smaller integers, 
just as we were able to talk about larger and smaller whole 
numbers. We give a meaning to the term larger as applied 
to integers by agreeing that one integer will be considered 
larger than another if it lies to the right of it on the line 
on which the integers are represented as points in the 
diagram below. The term can also be defined without refer- 
ence to the picture. If a and b are two different integers, 
we say that a is larger than b if a — b is positive. It is under- 
stood here that a — b means a + the negative of b, in ac- 


-5 -4 -3 -2 -1 0 1 2 3 4 5 

— 1 *— > i ■ — . » ■ . 


cordance with the definition of subtraction of integers given 
on page 52. On the basis of this definition, 2 is larger than 
—7, because 2 — (—7) =2 + 7 = 9, which is positive. The 
integer —4 is larger than —5, because —4 — (—5) = —4 + 
5 = 1, which is positive. 
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The Integers Form a Group 

Before we proceed to any further extension of our num- 
ber system, let us stay with the system of integers for a 
while to observe some of its properties. We shall find within 
the system of integers examples of structures that form the 
typical subject matter of modern mathematics. 

We have two binary operations defined for the system of 
integers, addition and multiplication. Let us disregard mul- 
tiplication, and list some of the properties the system has in 
relation to the operation of addition alone. We observe 
these characteristics: 1) The operation of addition is associ- 
ative. 2) The system contains a zero element. 3) For every 
integer a in the system, its negative, —a, is also in the 
system. These characteristics make the system of integers 
an example of the type of structure that is known as a 
“group.” In fact, with a slight change in notation, intro- 
duced in order to give them a more general form, these 
characteristics make up the definition of a group. 

As we did once before on page 28, let us designate the 
binary operation by the symbol *. Zero is an example of an 
identity element, and if we designate it now by the letter 
e, the property that 0 + x = :r + 0 = 2 ; can be written in 
the more general notation as follows: e*x = x*e = x. 
Instead of the word “negative,” let us substitute the word 
“inverse,” and designate the inverse of a by a -1 . In this 
notation, the fact that the sum of any integer and its nega- 
tive is zero takes this form: a * a -1 = a -1 * a = e. 

Here, then, is the definition of a group : A group is a sys- 
tem of elements for which a binary operation * is defined, 
and which has these properties: 1) The operation * is asso- 
ciative. 2) The system contains an identity element e, with 
the property that, if x is any element in the system, e * x — 
x* e = x. 3) For every element a in the system, its inverse 
a -1 is also in the system, with the property that a * a -1 = 
a -1 * a = e. The word “negative” is used for an inverse 
only in the special case where the operation is designated 
by + and is called “addition.” 
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Another Example of a Group 

The type of structure known as a group has been singled 
out for special study by mathematicians because it is found 
in many places. The system of integers is only one of a 
multitude of systems that have a group structure. It hap- 
pens to be a group that contains an infinite number of 
elements. But there are also groups that contain a finite 
number of elements. As an example of a finite group, let 
us examine the group of “symmetries” of an equilateral tri- 
angle. 

An equilateral triangle has equal sides and equal angles. 
The symmetries of the triangle are motions that bring it 
into coincidence with itself. To get acquainted with these 
motions, it is best to see them in action as applied to a 
model. So, before reading the paragraphs that follow, cut 
an equilateral triangle out of paper, and label its vertices 
A, B, and C, as shown in the diagram. Enter the labels on 
the reverse side of the paper, too, so that the vertices can be 
identified even when the triangle is turned over. 

Let us begin with the triangle placed on a level surface 
so that one side, say BC, is horizontal, and the opposite 
vertex A lies above BC, as shown in the diagram below. 



Now, as we discover motions that bring the triangle into 
coincidence with itself, we shall assign a name to each one. 
One motion that will qualify is a clockwise rotation of 120 
degrees around the center of the triangle. The result of 
such a rotation is to put B in the place of A, A in the place 
of C, and C in the place of B. Let us call this rotation P. 
Another motion that qualifies is a clockwise rotation of 240 
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Motion 

S/mbol 

First position 

Final position 

No motion 

1 

A 

A 



Rotate 120 * 
clockwise 

P 

A 

A 



Rotate 240 * 
clockwise 

Q 

A 

A 



Flip over, 
keeping top 
vertex fixed 

R 

A 

A 



Flip over, 
keeping left 
vertex fixed 

S 

A 

A 

Flip over, 
keeping right 
vertex fixed 

T 

A 

A 

- — ’ 



degrees. It puts C in the place of A, B in the place of C, and 
A in the place of B. Let us call it Q. Another “motion” that 
qualifies is a clockwise rotation of 0 degrees. This, of course, 
involves no movement of the triangle at all, and leaves A 
in the place of A, B in the place of B and C in the place of 
C. We shall call it I. 
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Before we go any further, let us make an agreement. We 
shall consider two motions as being the “same” if they have 
the same effect. A clockwise rotation of 360 degrees has the 
same effect as a rotation of 0 degrees, so it, too, will be 
called 7. A counterclockwise rotation of 120 degrees has the 
same effect as a clockwise rotation of 240 degrees, so it, too, 
will be called Q. Similarly, a counterclockwise rotation of 
240 degrees is the same as P. 

There are three more motions that can bring the triangle 
into coincidence with itself. In one of them, we flip the tri- 
angle over to bring the bottom face up, while leaving the 
top vertex where it is. If we start with A at the top, B at the 
left, and C at the right, this motion puts A in its own place, 
and makes B and C change places. Let us call this motion 
R. A similar motion that keeps the vertex on the left fixed, 
while the top vertex and the vertex on the right change 
places, will be called S. One which keeps the vertex on the 
right fixed, while the top vertex and the vertex on the left 
change places, will be called T. 

We now have a system consisting of six elements, 7, P, 
Q, R, S, and T. We define a binary operation * for this sys- 
tem as follows: If A and B represent any two of these mo- 
tions, the product A * B is the motion that results when the 
two motions are performed one right after the other, with B 



P* R = $ 
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performed first, and A taking over where B leaves off. For 
example, P * R means first perform R, then P. If we start 
with vertex A at the top, B at the left, and C at the right, 
motion R leaves A at the top, and makes B and C change 
places. Then, from this position, motion P rotates the tri- 
angle 120 degrees clockwise. As a result, A moves to the po- 
sition on the right, B moves to the position on the left, and 
C moves to the top. The effect is the same as if only the 
single motion S had been used. So P * R = S. If we pick 
any two of the six motions at random, and perform one after 
the other, we find that the result is always one of the 
original six motions. The results of performing the opera- 
tion * can be summarized in the multiplication table shown 
below, where the motion that is performed first, and is writ- 
ten on the right hand side in a product, is listed at the top 
of the table, and the motion that is performed second, and 
is written on the left in a product, is listed at the side. 

Multiplication Table 
for Symmetries of the triangle 


* 

I 

P 

Q 

R 

s 

T 

I 

I 

P 

Q 

R 

s 

T 

p 

P 

Q 

I 

S 

T 

R 

Q 

Q 

I 

P 

T 

R 

S 

R 

R 

T 

s 

I 

Q 

P 

S 

S 

R 

T 

P 

I 

Q 

T 

T 

S 

R 

Q 

P 

I 


To show that the symmetries of the triangle form a 
group, we have to prove that the three requirements for a 
group are satisfied. 1) The operation * is associative. That 
is, if A, B, and C are any three motions in the system, 
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A * (B * C) = (A * B) * C. This can be verified from the 
table, or from the fact that both symbols, A* (B * C), and 
{A * B) * C require that C be performed first, B second, 
and A third, so that both lead to the same result. The paren- 
theses here signify no more than a pause between motions. 
2) The system has an identity element. In fact I is the 
identity element, because the table shows that if X stands 
for any element, I*X = X*I = X. 3) For every element 
in the system, there is an inverse element that is also in 
the system. The inverse of P is Q, and the inverse of Q is P, 
because P*Q = Q*P = I. Each of the elements, I, R, S, 
and T is its own inverse, because /*/ = /, R* R = I, 
S * S = I, and T * T = I. Therefore the symmetries of the 
triangle, with the operation * as defined, form a group. 

Not All Groups Are Commutative 

We see from the multiplication table for the group of 
symmetries that P* R—S, and that R* P = T. Therefore 
P * R is not equal to R * P. The operation * in this group 
does not obey the commutative law. On the other hand, in 
the system of integers, which forms a group with the opera- 
tion +, the operation is commutative. A group which 
obeys the commutative law is called a commutative group. 
It is also called an abelian group in honor of one of the 
pioneers in the study of groups, Niels Henrik Abel, who died 
at the age of 27 in 1829. It is now the custom to use the 
symbol + to represent the binary operation in a group only 
if it is a commutative or abelian group. If the plus sign is 
used to designate the operation, then the word zero is used 
instead of identity, and the word negative is used instead of 
inverse. 

Although the system of integers is the most commonly 
used system that has a group structure, it was not the first 
system to have its group structure analyzed. The first 
groups that were studied extensively were finite groups, 
like the group of symmetries of the triangle. They came up 
in the theory of equations, as studied by the methods of the 
French mathematician Evariste Galois. Galois recorded his 
findings in 1832 in a paper that he wrote the night before 
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he was killed in a duel at the age of 21. After being neg- 
lected for about thirty years, they were saved from ob- 
scurity and extended by other mathematicians. Since then, 
the concept of a group has invaded every branch of mathe- 
matics. 

The Integers Form a Ring 

To observe the group structure of the system of integers, 
we disregarded the operation of multiplication. Now, if we 
put multiplication back into the picture, we find in the 
system of integers an example of another structure that 
plays an important part in modern mathematics, a ring. A 
system of elements is called a ring if there are two binary 
operations defined for the system, and they have these 
properties: 1) Both operations are associative. 2) The sys- 
tem is an abelian group with respect to one of the opera- 
tions. This operation is designated by +, and is called addi- 
tion. 3) The other operation is distributive with respect to 
addition. If we call it multiplication, and represent it by the 
usual symbol for “times,” then the distributive law takes 
this form: a ■ (b + c) = a ■ b + a ■ c; and (b + c) • a = 
b • a + c ■ a. It is necessary to state the distributive law 
in two parts in this way, because the multiplication opera- 
tion need not be commutative, so multiplication by a from 
the left is not the same as multiplication by a from the 
right. 

The fact that the system of integers is a ring follows 
from the properties of the system that we have already seen. 

Multiplication by Zero 

One of the familiar rules of arithmetic that we use every 
day when we work with natural numbers is that zero times 
any number gives a product equal to zero. This rule turns 
out to be true in the system of integers as well. In fact, we 
can show that it has to be true in any system that has a ring 
structure. In any ring, if x is any element, and 0 is its zero 
element, z + 0 = z. Now multiply both sides of this equa- 
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tion by any other element in the ring, say y. Then we have 
y • (x + 0) = y • x. By the distributive law in the ring, we 
can replace y • (x + 0) by y ■ x + y • 0, so we get y • x + 
y • 0 = y • x. Since the ring is a group with respect to addi- 
tion, the element y • x has a negative, —(y-x). Let us add 
this negative to both sides of the equation. We get 
— (y -x)+y-x + y- 0 = —(y • x) + y • x. But 
~(y ’ x) + y ' x = 0, from the definition of a negative. So 
we now have 0 + y • 0 = 0. But since 0 is a zero element 
for addition, 0 + y • 0 can be replaced by y • 0. This leads 
to the conclusion that y • 0 = 0. 

Hidden Groups and Rings 

The group structure and ring structure that we have ob- 
served so far in the system of integers lie on the surface, 
we might say, because they embrace the system as a whole. 
However, there are more group and ring structures hidden 
within the system of integers as substructures. We shall 
produce some examples for examination. 

Suppose we divide the integer 6 by 3. The quotient is 
2, which is also an integer. Then 6 can be written as 2 • 3. 
There are other integers that can also be written as some 
integer times 3. For example, — 9 = — 3 • 3. All such in- 
tegers are called integral multiples of 3. The integral multi- 
ples of 3 form the set {0, 3, —3, 6, —6, 9, —9, 12, —12, 

}• The dots inside the brace indicate that there are 

many more members of the set besides those that are listed. 
A characteristic of all integral multiples of 3 is that when 
we divide them by 3, the remainder is 0. 

The integers which are not multiples of 3 can be divided 
into two families. One family consists of those that are 1 
more than an integral multiple of 3. For example, 7 = 
6 + 1, and —5 = —6 + 1, so 7 and —5 both belong to this 
family. Members of this family have the characteristic that 
when we divide them by 3, the remainder is 1. The other 
family consists of those integers that are 2 more than an 
integral multiple of 3. For example, 8 = 6 + 2, and —4 = 
—6 + 2, so 8 and —4 belong to this family. Members of this 
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family have the characteristic that when we divide them by 
3, the remainder is 2. All integers therefore fall into one of 
three classes, depending on whether the remainder, when we 
divide by 3, is 0, 1, or 2. We call these classes residue classes 
modulo 3. For convenience in talking about them, we shall 
designate each class by the remainder which is characteristic 
of it. Here, then, are the three classes: 


0 class: 

1 class: 

2 class: 


/0, 3, 6, 9, 12, 

l -3, -6, -9, -12, . . . 

f 1, 4, 7, 10, 13,. 

1-2, -5, -8, -11, -14,. 

/ 2, 5, 8, 11, 14,. 

\-l, -4, -7, -10, -13,. 


} 

} 

} 


The 0 class has some interesting properties. Notice first 
that if we add any two members of the 0 class, the sum is 
itself a member of the 0 class. For example, 3 + (—6) = 
—3. This is not true of the other two classes. For example, 
although 1 and 4 belong to the 1 class, their sum, 1+4 = 5, 
which is not in the 1 class. Similarly, although 2 and 5 be- 
long to the 2 class, their sum, 2 + 5 = 7, which is not in the 
2 class. Therefore, addition is a binary operation for the 0 
class, but not for the 1 class or the 2 class. (Recall that a 
binary operation is defined for a set only by a mapping that 
associates with each pair of elements in the set another 
element in the same set.) Since addition is associative for 
the whole system of integers, it is surely associative when 
used with the members of the 0 class alone. 

Notice, secondly, that the 0 class contains the identity 
element for addition, namely, 0. Moreover, for every ele- 
ment in the 0 class, its negative also belongs to the class. 
Therefore the 0 class satisfies all the requirements for being 
a group in its own right. In fact, since addition is commu- 
tative, it is even an abelian group. Because it is a group 
that is a subset of the larger group of integers, it is called 
a subgroup of the group of integers. The other two residue 
classes modulo 3 cannot qualify as subgroups with respect 
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to addition, because, as we have seen, addition is not even 
a binary operation for them. 

We can say even more about the 0 class. If we multiply 
any two members of the 0 class, the product is itself a mem- 
ber of the 0 class. So that multiplication is also a binary 
operation for this class. Moreover, it obeys the distribu- 
tive law, so the 0 class satisfies all the requirements for 
being a ring in its own right. It is called a subring of the 
ring of integers. 

We can say still more about the 0 class. If we multiply 
any member of the 0 class by any integer, whether it is in 
the class or not, the product is a member of the 0 class. For 
example, 4 • 3 = 12, which is in the 0 class, although 4 is 
not A subring which has this property is called an ideal, 

S °mn A C ! aSS modul ° 3 is an ideal in the ring of integers 

Ifie 0 class, considered as a ring, differs in one very im- 
portant way from the ring of integers. It does not contain 
the number 1, which is the unity element for multiplica- 
tion. bo we see it is possible for a ring to be with or with- 
out a unity element. O 


The Arithmetic of Residue Classes 

We are now going to construct a new number system 
whose elements are the residue classes modulo 3 To do so 
we must define what we mean by addition and multiplica- 
tion of these classes. We define addition as follows: To add 
two residue classes, pick one member from each class, and 
add these members. The class to which the sum belongs will 
be called the sum of those residue classes. For example, to add 
the 1 class and the 2 class, we might add 1 (chosen from the 
1 class) to 2 (chosen from the 2 class). The sum is 3, which 
belongs to the 0 class. Therefore the 1 class plus the 2 
class equals the 0 class. Our definition allows us a free choice 
in picking the numbers in each class that we will add. How- 
ever, it makes no difference which ones we choose, because 
ail members of the same class have the same remainder 
when divided by 3. Adding a representative from each of 
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two classes is like adding their remainders, and the result 
is the same no matter which representative is chosen. 

We can prove this fact for the illustration being given 
in this way: Any number in the 1 class is of the form 
3a + 1. Any number in the 2 class is of the form 3b + 2. 
When we add them, we get 3a + 1 + 3b + 2 = 3a + 3b + 
3 = 3(a + b + l), that is a multiple of 3, which is in the 0 
class. For the sake of brevity, let us now drop the word class 
from the name of each class, and simply refer to them as 
0, 1, and 2. Then we get the following addition table for 
residue classes modulo 3: 


+ 

0 

1 

2 

0 

0 

1 

2 

1 

1 

2 

0 

2 

2 

0 

1 


We define multiplication in the same way: To find the 
product of two residue classes, multiply any member of one 
class by any member of the other. The class to which the 
product belongs is the product of the classes. For example, 
to multiply the 2 class by the 2 class, multiply 2 (chosen 
from the 2 class) by 5 (chosen from the 2 class). The 
product is 10, which belongs to the 1 class. So the 2 
class times the 2 class equals the 1 class. As in the case 
of addition, it makes no difference which member of 
each class is chosen for carrying out the operation. Here, 
too, we drop the word class from the name of each class, 
and record the products in the following multiplication 
table for residue classes modulo 3 : 
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• 

0 

1 

to 

0 

0 

0 

0 

1 

0 

1 

2 

2 

0 

2 

1 


With these tables for addition and multiplication, the 
residue classes modulo 3 obey the five laws. Therefore they 
make up a number system consisting of only 3 elements. 
This number system is also both a group and a ring. It is 
a group with respect to addition, because 1) it contains a 
zero element, viz., the 0 class, and 2) for every element in 
the system, there is a negative in the system, too. The 0 
class is a zero element, because, as the addition table shows, 
tnat class added to any other class leaves it unchanged: 
® — 0, 0 + 1 = 1, and 0 + 2 = 2. The negative of 0 is 

. I he negative of 1 is 2, and vice versa, because 1 + 2 = 
2 + 1 = 0- The group is abelian, because the addition opera- 
tion is commutative. The system is also a ring, because, be- 
sides being an abelian group, it has a multiplication opera- 
tion which is distributive with respect to addition. For exam- 
ple, 2 (1 + 2) = 2 (0) = 0, and 2-l + 2- 2 = 2+ l= 0 
Therefore 2(1+ 2) = 2 • 1 + 2-2. 

Quotient Groups 

The group that we have just observed contained as its 
elements a subgroup of the system of integers (the 0 class) 
and certain companion sets, the 1 class and the 2 class.’ 
The companion sets can be obtained from the subgroup 
oy adding to each member of the subgroup some element 
that is not in it. For example, by adding 1 to each of the 
e ements in the 0 class, we get all the elements of the 1 
class. In fact, if we add any single member of the 1 class 
to each of the elements in the 0 class, we get all the mem- 
bers of the 1 class. Similarly, if we add any member of the 
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2 class to each of the elements of the 0 class, we get all 
the members of the 2 class. Companion classes that have 
this relationship to a subgroup are called its cosets. When 
a subgroup and its cosets are themselves elements of a 
group, the group they form is called a quotient group. 

If we look back to the table for multiplication of the sym- 
metries of a triangle (page 59), we shall find another exam- 
ple of a subgroup, its cosets, and a quotient group that 
they form. Here, of course, the group operation will be * 
instead of +. Notice, first, that when 7, P, and Q are multi- 
plied among themselves, the product is always 7, or P, or Q. 

Therefore the operation * is a binary operation for the sub- 
set {7, P, Q}. The identity element 7 is in this subset, and 
the inverse of every element in the subset is also in the 
subset. In fact, the inverse of 7 is 7, the inverse of P is Q, 
and the inverse of Q is P. Therefore the subset {7, P, Q\ 
forms a subgroup with the operation *. Its coset is the set 
{R, S, T}. 

Now let us assign a short name to each of these sets, by 
writing G = {I, P, Q}, and 77 = {R, S, T}. We can define 
an operation * on the system whose elements are G and 77 
as follows. To multiply one set by another, pick an element 
from each set, and multiply them. For example, take P 
(from G) * S (from 77). The product is T, which is in 77. 

The set to which the product belongs will be called the 
product of the sets. Therefore G * 77 = 77. As in the case of 
residue classes, it turns out that the product comes out the 
same no matter which representative we choose from each , 

class to carry out the operation. The products we get can 
be summarized in this multiplication table: < 


* 

G 

77 

G 

G 

77 

77 

77 

G 
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In this system of two elements, G is an identity element for 
the operation *, because G * G = G, and G * H — H * 
G = H. Also, there is an inverse for every element. In fact, 
each element is its own inverse, because G* G = G, and 
H * H = G, which is the identity element. Therefore the 
subgroup G and the coset H form a quotient group of the 
group of symmetries of the triangle. In this case, we can see 
why it is called a quotient group. The group of symmetries 
has 6 elements, the subgroup G has 3 elements, and the 
quotient group has 6 -4- 3 = 2 elements. Not every sub- 
group, together with its cosets, will form a quotient group. 
Galois was the first to identify the special kind of subgroups 
that have this property. They are known as normal sub- 
groups. In an abelian group all subgroups are normal. 

We saw that the residue classes modulo 3 form a ring as 
well as a group. It is an example of a quotient ring. Just as 
in a group, a special kind of subgroup with cosets forms a 
quotient group, in a ring, a special kind of ideal with its 
cosets form a quotient ring. In a ring in which the multipli- 
cation is commutative, all ideals have this property. 


Residue Classes Modulo 6 

We formed the residue classes modulo 3 by classifying 
the integers by the remainder you get when you divide by 3, 
using for this purpose only positive remainders that are less 
than 3, viz., 0, 1 and 2. If we use any other integer as divisor, 
we can divide the system of integers into residue classes 
modulo that integer in the same way. For example, when an 
integer is divided by 6, the remainder may be 0, 1, 2, 3, 4, 
or 5. This gives us six residue classes modulo 6. As we did 
in the case of residue classes modulo 3, let us use the re- 
mainder associated with each class as the name of the 
class. If we define addition and multiplication of residue 
classes the same way we did before, we get these addition 
and multiplication tables - 
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+ 

0 

1 

2 

3 

4 

5 

• 

0 

1 

2 

3 

4 

5 

0 

0 

1 

2 

3 

4 

5 

0 

0 

0 

0 

0 

0 

0 

1 

1 

2 

3 

4 

5 

0 

1 

0 

1 

2 

3 

4 

5 

2 

2 

3 

4 

5 

0 

1 

2 

0 

2 

4 

0 

2 

4 

3 

3 

4 

5 

0 

1 

2 

3 

0 

3 

0 

3 

0 

3 

4 

4 

5 

0 

1 

2 

3 

4 

0 

4 

2 

0 

4 

2 

5 

5 

0 

1 

2 

3 

4 

5 

0 
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With these tables, the residue classes modulo 6 form a 
ring. This ring has a peculiar feature that it does not share 
with the ring of integers. In the ring of integers, the only 
time a product of two elements equals 0 is when one of the 
multipliers is itself 0. On the other hand, in the ring of 
residue classes modulo 6, 2-3 = 0, and neither 2 nor 3 is 
itself zero. Non-zero elements, like 2 and 3, which when 
multiplied give a zero product are called zero divisors. The 
ring of integers has no zero divisors, but the ring of residue 
classes modulo 6 does have zero divisors. 

The absence of zero divisors in the ring of integers is 
the basis for one of the very important rules we all learned 
in elementary high school algebra, the cancellation law of 
multiplication. We learned that if 2 • x = 2 • 3, we may 
cancel the 2 on both sides of the equation, and conclude that 
x ~ 3. The argument that proves this is correct proceeds as 
follows : Add the negative of 2 • 3 to both sides of the equa- 
tion. This gives us 2 • x — 2 • 3 = 0. By using the distribu- 
tive law, we get 2 • x — 2 • 3 = 2 (x — 3). Therefore 2 • 
(x — 3) =0. This statement tells us that the product of two 
integers equals 0. This can happen only if one of the multi- 
pliers is 0. Since 2 is not 0, the other multiplier, x — 3, must 
be 0. Therefore x has to be 3. In the ring of residue classes 
modulo 6, this whole argument breaks down, because there 
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are zero divisors in the system. That is, a product can be 
equal to 0, without either of the multipliers being equal to 
0. Consequently, the cancellation law of multiplication is 
not obeyed in this system. In fact, in this system, if 2 • x — 
2, we cannot conclude that x = l. The multiplication table 
shows that while x — 1 is one possible solution to the equa- 
tion, x = 4 is another solution, because 2-4 = 2. 

Mapping Group into Group 

We can set up a mapping of the group of integers into 
the group of residue classes modulo 3 by assigning to each 
integer in the system of integers the residue class that it 
belongs to, as shown in the table: 

0^0 

1 -> 1 

2 — » 2 

3 — > 0 

4 ->• 1 

5 — > 2 

6 — » 0 

By this mapping, 0, 3, 6, 9, and 12, for example, have the 0 
class as their image; 1, 4, 7, 10, and 13 have the 1 class 
as their image; and 2, 5, 8, and 14 have the 2 class as their 
image. The mapping is clearly a many-to-one mapping. It 
has the interesting property of preserving the operation 
that is defined for the group. That is, the image of a sum 
is the sum of the images. For example, the image of 3 is 0, 
and the image of 4 is 1. The sum of 3 and 4 is 7, and its 
image is 1, which is the sum of the images 0 and 1. A map- 
ping like this, of one group into another, which preserves 
the group operation is called a group homomorphism. 
Where it is a many-to-one mapping it is like collapsing or 
telescoping the group to make it fit into a smaller one. If we 
consider the system of integers and the system of residue 
classes modulo 3 as rings, the same mapping preserves not 
only the group operation of addition, but also the other 
ring operation, multiplication. For this reason, it is also an 
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example of a ring homomorphism (one which preserves the 
operations in the ring). 

When a group homomorphism of one group into another 
is a one-to-one correspondence, then it is a group isomor- 
phism. Each element in one group is then paired off with one 
and only one element in the other. In that case, we say 
that the two groups are isomorphic to each other, or have 
the same structure. Two groups that are isomorphic to 
each other are really the same group structure dressed up 
in different clothes. We have an example of isomorphic 
groups in the group of residue classes modulo 3, and the sub- 
group {/, P, Q} of the group of symmetries of the triangle. 
A mapping which matches them one-to-one, and preserves 
the group operation is shown below: 

0 < > I 

1 < > P 

2 < » Q 

+ < ► * 

If we take any true statement in one system, such as 
1 + 2 = 0, and replace each symbol by its image under the 
mapping, we get a true statement in the other system. In 
the example shown, we get P * Q = I. In other words, the 
symbols used in the two systems are like two different 
languages that may be used for expressing the same ideas. 
The isomorphism printed above is the dictionary that al- 
lows us to translate from one language into the other. 

DO IT YOURSELF 

1. Using the definition of multiplication of integers given 
on page 48, prove that multiplication of integers is 
commutative by showing that 

(a ~ b) • (c ~ d) - (c ~ d) ■ ( a ~ b). 

2. Using (0 ~ a) and (0 ~ b ) where a and b are not zero 
to represent any two negative integers, prove that the 
sum of two negative integers is a negative integer. 

3. Using (0 ~ a) with a not zero to represent any nega- 
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tive integer, and (b ~ 0) with b not zero to represent 
any positive integer, prove that the product of a nega- 
tive integer and a positive integer is a negative integer. 
4. A rearrangement of the numbers 1, 2, 3 is called a 
permutation of these numbers. Each rearrangement 
has the effect of replacing one number by another. 
For example, if the arrangement 123 is changed to 
312, 1 is replaced by 3, 2 is replaced by 1, and 3 is 
replaced by 2. This permutation can be represented 
as a mapping : 

1 -> 3 

2 -► 1 
3 -> 2 

There are six possible permutations of three numbers. 
Call them I, A, B, C, D, E as follows: 

I A B 

1 -» 1 1 -> 2 1 — > 3 

2 — > 2 2 — > 3 2— >1 

3 — > 3 3 1 3 — > 2 


D E 

1 — > 3 1 — » 2 

2 -> 2 2 — > 1 

3 -» 1 3-^3 

Define the product of two permutations X * Y as 
the result of performing Y first and X afterwards on 
the result of F. How a product is identified is shown 
in the following example : To find A* B: 

B: A: J; 

1 ->3 -* 1 1 -» 1 

2 — > 1 — » 2 Result : 2 — > 2 

3 2 — > 3 3 — » 3 
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Therefore A* B = I. 

a) Construct the multiplication table for the per- 
mutations I, A, B, C, D, E. 

b) Prove that they form a group with the opera- 
tion *. 

c ) Identify the subgroups of this group. 

d) Show that this group of permutations is iso- 
morphic to the group of symmetries of the tri- 
angle. 

5. Use the associated remainders, 0, 1, 2, 3, 4 as the names 
of the residue classes modulo 5. Use the definitions of 
addition and multiplication given on pages 64-5 to con- 
struct addition and multiplication tables for these 
residue classes. 

Use the tables to verify that 2 - (1 + 2) = 2-1 + 
2 • 2 in this system. Does this system have zero 
divisors? 

6. Show that the set of all even integers is a subgroup 
of the group of integers with respect to the operation 
+. Show that it is a subring of the ring of integers. 
Show that it is an ideal of the ring of integers. (See 
definitions, pages 63-4.) 
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CHAPTER IV 


Numbers for Measuring 


Another Defect to Overcome 

THE system of whole numbers has the defect that 
subtraction is not always possible in that system. To over- 
come this defect, we constructed the system of integers, an 
enlarged number system that includes the whole numbers, 
and in which subtraction is always possible. In this chapter 
we undertake another extension of the number system for 
a similar purpose. We find that the system of integers has 
the defect that division is not always possible within the 
system. To overcome this defect, we shall construct an en- 
larged number system that includes the integers, and in 
which division is (almost) always possible. The word 
“almost” has to be included in the statement of our goal, 
because, in the enlarged system, there will still be one 
number whose use as a divisor will be forbidden. 

Just as subtraction of whole numbers was defined in 
terms of addition, division of integers may be defined in 

terms of multiplication. The symbol — really asks us the 

A 

question, “What integer multiplied by 2 gives -6 as the 
product?” Since the answer to the question is —3, we say 

— = _3. We call the symbol ~ the quotient of -6 and 

2. We also refer to it as a fraction, and in this case it has 
meaning as another symbol for the integer -3. The question 

0 

asked by the fraction can also be written in the form of 
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an equation, 2 • x = —6, and the answer to the question is 
the solution to the equation. 

However, some fractions ask us a question that we cannot 
answer in the system of integers. For example, the fraction 
2 

- asks the question, “What number, when multiplied by 3, 

O 

gives 2 as the product?” In the system of integers there isn’t 
any such number. So, in the system of integers, the fraction 
2 

- has no meaning, and the equation 3 ■ x = 2 has no solu- 

O 

tion. This situation offers us a challenge similar to the one 
we faced in the last chapter. Can we build a number system 
in which division is always possible for any pair of numbers, 

so that ^ always has a meaning, and b ■ x = a always has a 

solution? 

Zero Is an Exception 

We find that we can, provided that we agree not to use 
zero as a divisor. We can see the reason for this exclusion if 
we try to answer the question that is asked by a fraction that 
has 0 in the denominator. If the numerator is not 0, as in the 
2 

fraction -> the fraction asks, “What number multiplied by 0 

gives 2 as the product?” We hope to make the enlarged 
number system a ring. And in a ring, as we saw on page 61, 
zero times any other number gives a product equal to zero. 
Then the answer to the question will have to be “No num- 
ber.” On the other hand, if the numerator of the fraction 

is 0, the situation is even worse. Then the fraction asks, 

“What number multiplied by 0 will give 0 as a product?” 
The answer to this question would be, “any number.” In 
fact, even in the system of integers, this would be the answer. 
We would like the fractions in our expanded number system 
not to overreach themselves. We want each fraction to stand 

2 

for one and only one number. Since the fraction - stands for no 
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number, and the fraction - stands for too many numbers, 

we exclude them as legitimate fractions. So, from now on, 
whenever we talk about a fraction, it will be understood 
that the denominator may not be 0. 


Families of Fractions 

To construct the new number system, we use the same 
device that was employed in the last chapter. To be sure 

that the question asked by the fraction ^ (where b is not 0) 


always has an answer, we shall let the question be its own 
answer. We shall let each fraction represent a number in 
the new system. However, the numbers in this system will 
not be single fractions. Just as each number in the system 
of integers is a family of differences of whole numbers, each 
number in the system we are now constructing will be a 
family of fractions or quotients of integers. This is made neces- 
sary by the fact that, even among fractions that have a 
meaning in the system of integers, many fractions can repre- 


3 6 9 12 

sent the same number. For example, -> -> -> —> and many 

1 Z o 4 


others represent the number 3, so we shall have to put them 
together into one family. In fact, we can obtain from this 
example the criterion we shall use for deciding when two 


0 g 

fractions belong to the same family. Notice that ^ and -> 

2 3 

which belong to the same family, have the property that the 
numerator of each times the denominator of the other gives 
the same product, that is, 6 • 3 = 9 • 2. 

We are now ready to start building up the new number 
system. First, we take all possible ordered pairs of integers, 
such as 6 and 2, 5 and 7,-2 and 3, or -3 and -9, in which 
the second number in the pair is not zero. Then we write 
the “quotients” of the numbers in the pair, using the first 
number as numerator. So we now have a collection of frac- 

and — -• Next, we associate 


tions or quotients like 

Z i O 


-9 
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with each fraction a family of fractions according to the 

following rule: The family of fractions belonging to -> where b 

b 

u 

is not zero, consists of all those fractions - for which a ■ v = 

v 


u ■ b. To designate the family, we shall write the fraction - 

o 

inside parentheses. Thus, means the family of fractions 

belonging to -■ We call such a family of fractions a rational 
number. 

The rational numbers have some properties analogous to 
those of integers. First, a fraction belongs to its own family. 
Secondly, if one of two fractions belongs to the family of 
the other, then they have the same family. Because of these 
properties, each fraction belongs to one and only one rational 
number. A rational number can be represented by putting 
on display inside parentheses any one of the fractions that 

9 

belongs to it. For example, since - belongs to the rational 

O 


number [ - 



Finally, the criterion for member- 


6> 

V V3y 

ship in a rational number also serves as a test for equality of 
rational numbers. That is, the rational numbers and 
are equal if and only if a • d = c • b. For example, we know 
that because 6 • 3 = 9 • 2. 

Addition and Multiplication of Rational Numbers 

Addition and multiplication of rational numbers are 
defined by the following equations : 


(!) + (I) - (BB 
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To carry out the addition or multiplication of rational 
numbers, we apply these definitions to whatever fractions 
are on display inside the parentheses to represent the rational 
numbers. Since each rational number may be represented 
by any one of its members, the addition or multiplication 
may be carried out in many ways. The definitions make 
sense only if the result comes out the same no matter which 
member of a rational number is used to represent it. This 
turns out to be so. For example, we just obtained as the sum 



since 2 • 6 


- ) the rational number 


Gi> 


But m = ? 


= 4-3; and 


(!) = (!)- 


since 3 • 20 = 12 • 5. 


So we can add 



and 



by applying the definition to 


(t ) and (I) We get the sum (— l + 2o ' 12 ) - (ii)- 

But this is the same answer we got before, because 

152 • 15 = 19 • 120, showing that 

With addition and multiplication defined in this way, 
the rational numbers form a number system, because they 
obey the five laws. We prove this fact here only for the 
commutative law of addition: 


!)+(! 


( a ■ d + b • c \ 
b • d ~) 

_ / c • b + d ■ a \ 
\ d • b ~) 


77 


But the a, b, c, and d appearing in these symbols are integers, 
and integers obey the commutative laws for addition and 
multiplication. Therefore b ■ d = d ■ b, showing that the 
two results have the same denominator; and a • d + b • c = 
c ■ b + d ■ a, showing that the two results have the same 
numerator. Therefore the two sums are the same and 

A similar proof can be given for 


i) + u 


- G + (f 


each of the other four laws. 

Zero, Unity and Negatives 

Like the system of whole numbers and the system of in- 
tegers, the rational number system has a zero element and 

' 0 > 

J, 


is the zero ele- 


a unity element. The rational number 

ment, because Q + g) = (° ' \ + } ' ~) = (^)- 
( - ). The zero element can also be written in the form ( - )> 

\y/ \bj 

where b is any integer different from zero. This is proved 
by the test for equality of rational numbers, because 
0 • 1 = 0 • b. Where there is no danger of confusion with 
the zero of the system of integers, we use the symbol 0 for 
the zero element of the rational numbers, too. 

The rational number is the unity element, because 

(1) (y) ~ (l — y) ~ (jy) ^ 1C un ity element can also be 

written in the form where b is any integer different from 
zero. This is proved by the fact that 1 • b = b ■ 1. So 

(2) ’ (3)’ (4)’ an< ^ (^5) are intimate ways of repre- 
senting the unity element. A rational number is the unity 
element if it is represented by a fraction whose numerator 
and denominator are equal. Where there is no danger of 
confusion with the unity element of the system of integers, 
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we let the symbol 1 stand for the unity element of the 
rational numbers, too. 

The system of integers has one important property that 
the system of whole numbers does not have: every number in 
the system has a negative. The system of rational numbers 


has this property, too. In fact, if is any rational number, 
then (~^j is its negative. To prove this fact, we must show 


that their sum is the zero element: 

(t) + (f) - (— t H ( - 0) 


• The numerator of 


the fraction on display for the sum is a • b + b ■ ( — a). By 
the commutative law for multiplication of integers, a ■ b 
can be replaced by b ■ a, so the numerator is equal to 
b ■ a + b ■ ( — a). By the distributive law for integers, this 
sum is equal to b ■ (a + (-a)). But a + (-a) = 0, so we 
finally have that the numerator is equal to b ■ 0 = 0. There- 




the zero element in the rational 


number system. 


The Reciprocal of a Rational Number 


The negative of a number was defined in terms of addition : 
one number is the negative of another if the sum of the two 
numbers is zero. The analogous concept in relation to multi- 
plication is that of the reciprocal. One number is called the 
reciprocal of another if the product of the two numbers is 
the unity element. In the system of whole numbers, 1 is the 
only number that has a reciprocal. In fact, it is its own 
reciprocal, since 1 • 1 = 1. In the system of integers, there 
are only two numbers that have reciprocals. They are 1 
and —1. Each of them is its own reciprocal, because 
1 • 1 = 1, and (—1) • ( — 1) = 1. But in the rational num- 
ber system, the existence of reciprocals becomes the rule 
rather than the exception. Every rational number except the 


zero element in that system has a reciprocal. In fact, if 
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is any rational number that is not the zero element, then a 
is not 0. Consequently, is also a rational number, and it 

is the reciprocal of To prove that it is, we multiply 


them. The product 



= the unity ele- 


ment, since the numerator and the denominator of the 
fraction on display are equal. 

Since every rational number except the zero element has 
a reciprocal, it is convenient to introduce a special symbol 
to mean “the reciprocal of.” If A stands for any rational 


number that is different from zero, we write — for the re- 


ciprocal of A . Then, by the definition of reciprocal, we know 
that A ■ — = — • A = 1. 

Jx. Jx 


We undertook the construction of the system of rational 
numbers for the purpose of finding a number system in 
which the equation B • X — A always has a solution, as 
long as B is not zero. We can now show that we have 
achieved our purpose. If B and A are rational numbers, and 

B is not zero, then B has a reciprocal, ~ If we multiply both 

B 


sides of the equation by — we get — • B ■ X = — • A. But 

L> £> 

4 • B = 1, so we have 1 • X = 4 • A. But 1 • X = X, be- 

L> 

cause of the characteristic property of a unity element. 
Therefore we have found a solution to the equation. In fact, 

we know that X must be equal to — • A. Since solving the 

B 

equation B ■ X = A is another way of saying divide A by 
B, our result suggests a definition of division that is ap- 
propriate for rational numbers. To divide by a rational 
number means to multiply by its reciprocal. In the system 
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of rational numbers, division is always possible as long as 
the divisor is different from 0. 


We Still Have the Integers 


By constructing the rational number system, we have 
gained something, in that every number in it except 0 has 
a reciprocal. At the same time, we have lost nothing, be- 
cause we still have the original system of integers hidden 
within the rational number system in disguise. That is, 
there is a subset of the rational number system that is 
isomorphic to the system of integers, and therefore can take 
its place for all practical purposes. This subset consists of 


all rational numbers of the form 



where a is any integer, 


positive, negative, or 0. In fact, suppose we set up a mapping 


which matches each integer a with the rational number 

This mapping turns out to be an isomorphism, because, 
under this mapping, the image of a product is the product 
of the images, and the image of a sum is the sum of the 
images. The proof of this fact is seen in the following 
equations obtained by merely applying the definitions of 
addition and multiplication of rational numbers to the 



numbers 



Because of this isomorphism, the rational number 



may 


be thought of as being the “same” as the integer a, and we 
use the symbol for the integer as an abbreviated notation 
for the corresponding rational number. So, from now on, 
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we shall write 0 instead of and 1 instead of as 

already agreed. But we shall also write 2 instead of -2 

instead of and so on. We shall also drop the parenthe- 

ses in writing a rational number, so that, when we write the 
fraction -> we shall mean the whole family of fractions of 


which - is only a representative. With these conventions, we 

have the familiar notation for rational numbers that is used 
in everyday life. A rational number is called positive if it 
can be written as a fraction with positive numerator and 
denominator. It is called negative, if, when its denominator 
is positive, its numerator is negative. 


The Rational Points on a Line 

On page 30, we saw how we could represent the whole 
numbers as points on a line, spaced at equal intervals on 
one side of the point called 0. This procedure assigned num- 
bers to only some of the points on the line. On page 54, we 
saw that we could represent the integers as points on a line, 
too. We equated the positive integers with the natural 
numbers already associated with points on one side of the 0. 
Then we placed the negative integers on the other side of 
the 0. In this way we assigned numbers to more of the points 
on the line. Now we can continue the process and assign 
numbers to many of the points that lie between those that 
represent the integers. 

There is a point that divides the distance between 0 and 
1 into two equal parts. We call that point There are two 

points that divide the distance between 0 and 1 into three 

1 2 

equal parts. We call those points - and -• By a similar pro- 

o o 
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cedure, we can put — — -> and — - between 0 and —1. 


-1 


-Vs ~’/2 -Vs 


v 3 y 2 % 

i i 


This process can be extended so that we can find a point on 
the line for every rational number, positive or negative. 
The arrangement on the line makes it possible to talk about 
larger or smaller rational numbers in the same way that we 
could talk about larger or smaller integers. Of any two 
distinct rational numbers, the one that is further to the 
right is the larger one. Or, as in the case of integers, we can 
define the meaning of “larger” in this way: If a and b are 
rational numbers, a is larger than b if a — b is positive. 

The rational numbers are quite densely distributed over 
the whole line. Between any two points that represent 
rational numbers, there is at least another one that also 
represents a rational number. In fact, if a and b are two 

rational numbers, their average, —r—’ is also a rational 


number, and lies between them. The fact that we can always 
find “in between” numbers makes the rational number 
system the appropriate one to use to represent measure- 
ments where subdivisions of the unit may be needed. By 
using as many “in between” numbers as we wish, we can 
refine a scale of measurement as much as we like, and make 
measurements as precise as the physical limitations of our 
equipment and our senses will allow us to. 

Since the rational numbers are distributed densely over 
the line, we may guess that we now have a number assigned 
to every point on the line. But this guess turns out to be 
false, as we shall see. In fact, the lack of numbers for some 
points on the line is the next defect in the number system 
that we shall try to eliminate by expanding it once more. 


The Rational Numbers Form a Field 


The rational number system, like the system of integers, 
is an example of a group structure and a ring structure. 
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Addition of rational numbers is associative. There is an 
identity element for addition (the zero), and each element 
has an inverse with respect to addition (the negative). So 
the rational number system meets all the requirements for 
being a group with respect to addition. In fact, since addition 
of rational numbers is commutative, it is an abelian group, 
and we are conforming to custom by using a plus sign for 
the group operation. The operation of multiplication in the 
rational number system is associative, and it is also dis- 
tributive with respect to addition. With these further proper- 
ties, the rational number system meets the requirements 
for being a ring. In fact, since multiplication is commutative, 
it is a commutative ring. Moreover, it contains a unity 
element. 

In the transition from integers to rational numbers, some- 
thing new has been added. Every rational number except 0 
has a reciprocal. But a reciprocal is simply an inverse with re- 
spect to the operation of multiplication. So the rational num- 
ber system with zero omitted meets all the requirements for 
being a group with respect to multiplication. It therefore has 
a double group structure, one for addition, and one for 
multiplication. A system of this kind, that has a double group 
structure, is called a field, A field is defined as a ring in which 
a unity element exists, and which has a reciprocal for every 
element except zero. The presence of these reciprocals makes 
it possible to carry out division by any element except zero. 

The way in which groups, rings, and fields differ from each 
other may be expressed briefly, though crudely, as follows: 
A group is a system in which we can perform addition and 
subtraction. A ring is a system in which we can perform 
addition, subtraction, and multiplication. A field is a system 
in which we can perform addition, subtraction, multipli- 
cation, and division, except that division by 0 is excluded. 

Finite Fields 

The rational number system is a field that contains an 
infinite number of elements. There are also fields that have 
only a finite number of elements. In fact, we have already 
encountered some in earlier sections of this book. In the last 
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chapter, we found that the system of residue classes of 
integers modulo 3 has a ring structure. The elements in the 
ring are classes called 0, 1, and 2. The multiplication table 
in this ring is as follows: 


• 

0 

1 

2 

0 

0 

0 

0 

1 

0 

1 

2 

2 

0 

2 

1 


We see from this table that every element in the ring 
except zero has a reciprocal. In fact, the reciprocal of 1 is 1, 
because 1-1 = 1; and the reciprocal of 2 is 2, because 
2-2 = 1. So the system of residue classes of integers 
modulo 3 is a field. 

If you did exercise 5 in the “Do It Yourself” section at 
the end of Chapter III, you have met another finite field. 
The residue classes of integers modulo 5 constitute a ring 
with this multiplication table : 


• 

0 

1 

2 

3 

4 

0 

0 

0 

0 

0 

0 

1 

0 

1 

2 

3 

4 

2 

0 

2 

4 

1 

3 

3 

0 

3 

1 

4 

2 

4 

0 

4 

3 

2 

1 


This table shows that every element except zero in this 
system has a reciprocal. In fact, the reciprocal of 1 is 1 ; the 
reciprocal of 2 is 3; the reciprocal of 3 is 2; and the reciprocal 
of 4 is 4. 

On the other hand, the residue classes of integers modulo 
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6 constitute a ring that is not a field. The multiplication 
table for this system, printed on page 68, shows that some 
of its elements which are not zero do not have a reciprocal. 
In fact, there is no element whose product with 2, 3, or 4 is 
equal to 1. So the elements 2, 3, and 4 in this system do not 
have reciprocals. The reason for this failure is that the 
number 6 has positive integer divisors besides itself and the 
number 1. These divisors are 2 and 3. Any integer which is 
also divisible by 2 or 3 belongs to a residue class that cannot 
have a reciprocal. Let us prove this fact for an integer 
divisible by 2. 

Suppose that such an integer belongs to residue class a, 
and we multiply by any other residue class x. We shall show 
that the product a ■ x cannot be equal to 1, and so a cannot 
have a reciprocal. To identify the product of a and x, we 
follow the directions given on page 65. We pick any member 
of a and any member of x, multiply them, and then identify 
the residue class of the product. Identifying the residue class 
of the product means finding its remainder when it is divided 
by 6. As the representative of a let us use the member that 
we know is in it that is divisible by 2. Since it is divisible 
by 2 we may represent it as 2 • m, where m is some other 
integer. Let us designate by k the member we select from 
class x. Then the product of the two representatives of 
their classes is 2 • m • k. Now we divide this product by 
6, and obtain a quotient and a remainder, both of which are 
integers. Let us call the quotient q, and the remainder r. 
Then, from the fact that a dividend is equal to the divisor 
times the quotient plus the remainder, we can say that 
2-m-k = Q- q + r. Therefore r = 2- m- k- G-q. By 
the distributive law, r = 2 ■ (m • k - 3 • q). That is, the 
remainder is divisible by 2. Therefore it cannot be 1. There- 
fore the residue class to which 2 • m • k belongs is not class 
1. Therefore the product of class a and class x cannot be 
class 1, no matter what class x may be. 

By means of a similar proof, it can be shown that, in 
general, the ring of residue classes modulo n is not a field if 
n has positive integer divisors besides itself and the number 
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1. An integer that has positive integer divisors other than 
itself and 1 is called factorable. An integer that is not factor- 
able is called prime. It can also be shown that if n is a prime 
integer, the ring of residue classes modulo n is a field. The 
integers 3 and 5 are both prime. That is why the ring of 
residue classes modulo 3 and the ring of residue classes 
modulo 5 both turned out to be fields. 

No Zero Divisors 

In the system of integers, we found that the cancellation 
law for multiplication was a consequence of the fact that the 
ring of integers has no zero divisors. Since this law is a great 
convenience in solving equations, it would be useful if it 
turned out to be true in the rational number system, too. 
Fortunately, it does, because the rational number system 
is a field, and a field cannot have zero divisors. The proof of 
this fact flows directly from the definition of zero divisors 
and the definition of a field. If a field did have zero divisors, 
there would be two elements in the field, say a and b, both 
not zero, but whose product is zero. So we could write 
a ■ b = 0. Since a is not zero, and the system is a field, it 

has a reciprocal, ~ Multiplying both sides by we get 

- • a • b = - • 0. But -•a = l, and - • 0 = 0, so we have 
a a a a 

1 • b = 0, or b = 0, contradicting the assumption that a 

and b are both not zero. Therefore it is impossible for a 

field to have zero divisors. 

Ideals in a Field 

The special properties of a field also impose some restric- 
tions on the ideals that it may contain. To get acquainted 
with these restrictions, let us first recall the definition 
of an ideal, and note some facts about the ideals of rings in 
general. We defined an ideal as a subring of a ring that has 
the property that if we multiply any member of the subring 
by any member of the ring, whether it is in the subring or 
not, the product turns out to be in the subring. For example, 
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the set of all even integers is an ideal in the ring of integers 
because, first, it is a subring, and secondly, the product of 
an even integer and any integer is an even integer. 

Every ring contains at least two ideals. One of them is 
the subset consisting of only one element, the 0 of the ring. 
The other is the entire ring itself. 

To show that the subset that contains only the 0 element 
is an ideal, we check to see if it fits the definition of an ideal. 
Notice first that addition within this subset is associative 
and commutative because it is in the ring as a whole. 
Multiplication within the subset is associative, and is dis- 
tributive with respect to addition because it is in the ring 
as a whole. We observe next that the subset {0} contains the 
zero element, and also the negative of each of its elements, 
since 0 is its own negative. Moreover, 0 + 0 = 0, so the 
sum of elements in the subset is in the subset. Therefore the 
subset meets all the requirements for being an abelian group. 
The product 0-0 = 0, and so is a member of the subset. 
And, since multiplication in it is associative and distributive 
with respect to addition, the subset meets all the require- 
ments for being a subring. Now we observe that any element 
in the ring times 0 gives a product equal to 0, so that this 
product is also in the subring. This last property makes the 
subring an ideal. Similarly, checking the requirements one 
by one, we see that the original ring is one of its ideals. 

The ideal that consists of the zero element alone is called 
the zero ideal. We shall also assign a special name to the 
original ring when viewed as one of its own ideals. We call 
it the unit ideal. The reason for this name is seen in the 
following considerations. Suppose the ring we are talking 
about has a unity element. (This is not true of all rings, as 
we saw on page 64, but it is true of all fields.) Let us take a 
close look at any ideal that contains the unity element 1. 
The characteristic property of an ideal is that when we 
multiply a member of the ideal by any member of the ring, 
the product is a member of the ideal. Let us, then, multiply 
1 by any element x in the ring. The product, x • 1 = x is 
therefore in the ideal. In other words, every element in the 
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ring is in the ideal. So, if an ideal contains the unity element, 
it contains all the elements of the ring, and therefore must 
be the whole ring. That is why the ideal that consists of the 
whole ring is called the unit ideal. We have in this fact and 
in this name a test for recognizing when an ideal is the unit 
ideal. To show that an ideal is the unit ideal of a ring, it 
suffices to show that the ideal contains the element 1. 

Since a field is a ring, it has these two special ideals, the 
zero ideal and the unit ideal. Now we show that a field has 
no other ideals besides these two. Suppose we examine any 
ideal of a field. Since the ideal is a subring, it must contain 
the 0 element. If it contains no other elements, then it is the 
zero ideal. If it contains some other element, say b, then b 

is different from zero, and therefore has a reciprocal ^ in the 

field. If we multiply the element b of the ideal by the 

product is a member of the ideal. But this product is 1. 
Therefore the ideal contains 1, and must be the unit ideal. 
This concludes the proof. 

DO IT YOURSELF 

1. Separate all integers into residue classes modulo 7, by 
putting into one class all the integers that have the same 
remainder when you divide by 7. By using the remainder 
as the name for the class associated with it, we get seven 
classes called 0, 1, 2, 3, 4, 5, and 6. Construct the multi- 
plication table for these residue classes (see page 65). 
Verify from the table that each element in this system 
except 0 has a reciprocal. What are the reciprocals of 
2, 3, 4, 5, and 6 in this system? 

2. Construct the multiplication table for residue classes 
modulo 12. Which elements are zero divisors in this 
system? 
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CHAPTER V 


Filling Out the Line 


More Questions to Be Answered 

SO FAR we have expanded the number system three 
times, from natural numbers to whole numbers to integers, 
and then from integers to rational numbers. In each of the 
last two expansions, the purpose of the expansion was to 
have a number system in which a certain type of equation 
would always have a solution. The first type of equation we 
tried to solve was one that included only a single step of 
addition. This was the equation of the form b + x = a. The 
second type of equation we tried to solve included only a 
single step of multiplication. This was the equation of the 
form b ■ x = a. It was natural to consider these equations 
first, because addition and multiplication are the operations 
that are built into the structure of a number system. 

It is just as natural now to go beyond these simplest 
equations, and consider others that may include both oper- 
ations, or may use an operation more than once. For example, 
we might examine equations like 2 • x + 5 = 11, or 
3 ■ (x + 2) = 7 which involve both addition and multipli- 
cation. We could also consider an equation like 3 • x ■ x ■ x 
-5-x-x = 2-x + 9 where the operation of multipli- 
cation is repeated several times. Such equations, in which 
only addition and multiplication may be employed, are 
known as algebraic equations. 

For a systematic survey of these equations, we first re- 
write them in standard form. Wherever the unknown “x” 
is multiplied by itself several times, we use the abbreviated 
power notation, writing x 2 for x ■ x, x 3 for x ■ x ■ x, and so 
on. We eliminate parentheses by using the distributive law. 
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For example, 3 • (x + 2) can be replaced by 3 • x + 6. 
We reduce one side of the equation to 0 by adding negatives 
where necessary. For example, in the equation x 2 — 3x = 
x + 6, if we add — x — 6 to both sides of the equation, we 
get x 2 — 3x — x — 6 = 0. Finally, we combine like terms, 
and arrange all terms in descending powers of x, getting 
x 2 — 4x — 6 = 0. 

Here are some typical equations, written in standard 
form, using numbers that belong to the rational number 
system : 



x 3 — 2-x 2 + J- x — 5 = 0 

O 

The highest power of x that appears in an algebraic 
equation in standard form is called the degree of the equation. 
In the rational number system, an equation of the first degree 
always has a solution. The typical first degree equation has 
the form a • x + b = 0, where a and b are rational numbers, 
and a is different from 0. To solve it, we take advantage of 
the fact that every rational number has a negative, and 
every rational number that is different from 0 has a re- 
ciprocal. First we add to both sides of the equation the 
number —6, which is the negative of b. This gives us 
a • x = —b. Then we multiply both sides of the equation 

by -> which is the reciprocal of a, and we find that x = 

- • ( — 5). This result tells us that if there is a solution to the 
a 

equation, it must be equal to - ( — 6). We verify that it is 

indeed a solution by substituting into the equation. For 

2 5 5 

example, to solve the equation - • x — ~ — 0, first add - to 

o Zt Z 
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o 

both sides of the equation. Then multiply both sides by — 

2 

The result asserts that if there is a solution, it must be equal 

3 5 15 15 

to - • -> or — • If we substitute — for x in the equation, it 

2 15 5 

then says - • — — - = 0. Calculation shows that this state- 


ment is true, so that — really is a solution to the equation. 

However, we have less luck when we try to solve equations 
of the second degree. In the rational number system, we can 
solve some of them, but we cannot solve others. For example, 
we can solve without any difficulty the equation x 2 — 1 = 0. 
First add 1 to both sides and we get x 2 = 1. In this form, 
the equation asks, “What number, when multiplied by itself, 
gives 1 as the product?” The number 1 is obviously an 
answer to this question, because 1-1 = 1. In fact, the 
number —1 is also a good answer to the question, because 
( — 1) ■ ( — 1) = 1. So we have found two solutions to the 
equation. The equation x 2 — 2 = 0 looks as though it ought 
to be just as easy to solve, but it isn’t. If we add 2 to both 
sides, we get x 2 = 2. In this form, the equation asks, “What 
number, when multiplied by itself, gives 2 as the product?” 
We may be tempted to say that the answer is V2, or the 
square root of 2. But this is really not an answer to the 
question. It is only a restatement of the question. When 
we write the symbol V2, it signifies, “that number (if it 
exists) which when multiplied by itself gives 2 as the 
product.” But the question still remains, “Does it exist?” 
We shall soon see that it does not exist in the rational 
number system. We shall prove that there is no rational 
number whose square is equal to 2. 

Before giving the proof, let us approach the question in 
another way. We find that the question we are trying to 
answer is one that comes up in a simple problem in geometry. 
Suppose we construct a square whose sides are one unit long, 
and then draw the diagonal of the square. The diagonal has 
a definite length. To calculate it, we use the Pythagorean 
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theorem that states that the square of the hypotenuse of 
a right triangle is equal to the sum of the squares of the legs. 
In this case the theorem leads to the equation x 2 = l 2 + l 2 , 
or x 2 = 2, which is precisely the equation that we are trying 
to solve. So, when we prove that there is no rational number 
whose square is equal to 2, we shall be proving at the same 
time that no rational number can represent the length of 
the diagonal of a square whose side has length 1. 

The proof makes use of some simple facts about rational 
numbers and integers that we shall take note of first. 1) 
Every rational number can be represented by a fraction that 
is “reduced to lowest terms.” 2) If a fraction is reduced to 
lowest terms, then its numerator and denominator cannot 
both be even numbers. For, if they were, it would mean that 
the fraction can be reduced further by dividing numerator 

0 

and denominator by 2. For example, - is not in lowest terms. 

O 

3 

It can be reduced to -• 3) The square of an even integer is 

an even integer, and the square of an odd integer is an odd 
integer. For example, 6 • 6 = 36, which is an even integer; 
7 • 7 = 49, which is an odd integer. From this fact, it follows 
that if the square of an integer is even, then the integer itself 
is even. 

We prove that there is no rational number whose square 
is equal to 2 by showing that the assumption that there is 
one leads to a contradiction. Suppose there is a rational 
number whose square is equal to 2. Then it can be repre- 
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sented by a fraction that is reduced to lowest terms. Let 7 

0 

represent this fraction, where a and b are both integers. 
Since the fraction is in lowest terms, a and b are not both even 
integers. Since the square of this fraction is supposed to be 

equal to 2, we can write — = 2. Multiplying both sides by 

b 2 , we find that a 2 = 2 • b 2 . This equation tells us that a 2 
is double the integer b 2 , so a 2 is an even integer. But if a 2 is 
even, then a must be even, that is, it is double some other 
integer. If we call that other integer k, then a = 2 • k. In 
that case, a 2 = (2 • k)(2 ■ k) = 4 • k 2 . If we substitute this 
expression for a 2 in the equation a 2 = 2 • b 2 , we get 4 • k 2 = 
2 • b 2 . Dividing both sides by 2, we see that 2 ■ k 2 = b 2 . In 
other words, b 2 is double the integer k 2 , or b 2 is an even 
integer. But if b 2 is even, then b must be even. We began by 
observing that a and b are not both even, and end up by 
concluding that they are both even. We were led to this 
contradiction by the assumption that there is a rational 
number whose square is equal to 2. Therefore we are com- 
pelled to reject the assumption. 

The proof given above is a very old one. It was first 
worked out about 2500 years ago by the Greek mathe- 
matician Pythagoras. The philosophers of Greece were so 
pleased to discover that there were lengths that could not 
be represented by rational numbers, that they celebrated 
the discovery, we are told, by sacrificing one hundred oxen 
to the gods. 

In Chapter IV we represented the rational numbers as 
points on a line. In this representation, wherever a positive 
number is attached to a point, the number represents the 
distance of that point from 0, if we use the distance from 0 
to 1 as the unit of length. Suppose, now, we measure out 
to the right of 0 a length equal to the length of the diagonal 
of a square whose side has length 1 (see diagram on page 
102). In this way we locate a definite point whose distance 
from 0 is equal to the length of the diagonal. But we have 
just proved that there is no rational number that can repre- 
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sent this length. So we have found a point on the line that 
has no rational number attached to it. This discovery 
answers the question that we raised on page 83. The rational 
number system supplies numbers for some, but not all, of 
the points on the line. Even though the rational numbers 
are spread out densely over the line, there are gaps that 
they leave unfilled. If we want to have a number to repre- 
sent every point on the line, we have to expand the number 
system again. We proceed to do so now, in order to fill the 
gaps. 


Decimal Fractions 


There are many ways of approaching the problem of filling 
the gaps between the rational numbers. We shall use here an 
approach that grows naturally out of the custom of writing 
numbers as “decimals.” Decimals are more correctly de- 
scribed as decimal fractions. The decimal .2 is an abbreviated 

2 

way of writing the fraction — • The decimal .23 is an ab- 


23 

breviated way of writing the fraction — • In each case, the 


denominator is understood to be a power of 10, and we 
identify the power by counting the number of digits to the 
right of the decimal point. Thus, since .235 has three digits 
after the decimal point, we know the denominator is 10 3 or 


1000, and the decimal .235 represents the fraction 

So we see that each decimal represents a fraction, and there- 
fore is simply another way of writing some of the numbers 
in the rational number system. This observation immediately 
suggests the question, “Can every rational number be 
written as a decimal?” We are led to a rather interesting 
problem when we try to answer this question. 

In elementary school we learned that we can convert a 
fraction into a decimal by using the process of long division. 


For example, to get the decimal equivalent of -> we divide 
4 into 1, using the following form: 
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■25 
4| 1.00 
8 

20 

20 


The long division terminates after two steps because in the 
last subtraction the remainder is zero. The conclusion is 

that ^ = .25. If we try the same procedure to get a decimal 
equivalent for the fraction we run into trouble. We arrange 

O 

the work in the same way, as follows: 

.3333 

3 1 1.0000 

9 

10 

9 

10 

9 

10 
9 

1 

But, no matter how many steps we carry out, the division 
never comes to an end. After each subtraction there is a 

remainder of 1. So the fraction ^ cannot be represented as a 

o 

decimal with a finite number of digits. 

The persistent reappearance of the remainder 1 tempts 
us to keep dividing. If we do so, we get a decimal that never 

ends. If we want to represent the fraction ^ by a decimal at 

O 

all, it will have to be an infinite decimal, that is, one that has 
an infinite number of digits after the decimal point. So, to 
decide whether every rational number can be represented 
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as a decimal we have to investigate what meaning, if any, 
an infinite decimal can have. 

To interpret the meaning of a finite decimal, we identify 
a numerator from the digits that we see, and a denominator 
by using the appropriate power of ten, indicated by the 
number of digits that appear after the decimal point. Then 
we put the two together in the form of a fraction. This 
method breaks down with an infinite decimal, so we have to 
try another approach. What we do is think of the infinite 
decimal as a sequence of finite decimals, made progressively 
longer by appending another digit at each step. In this view, 
the infinite decimal .333333 .... represents the infinite se- 
quence of finite decimals .3, .33, .333, .3333, .33333, .... To 


see how this sequence is related to the fraction let us com- 
pare each of the numbers in the sequence with the fraction. 
Suppose, for example, we subtract .3 from ~ Writing the 

O 


decimal as a common fraction, we say 


3 

10 


10 _ _9 
30 30 


on' This difference is rather small, and in many practical 


problems is small enough to be considered negligible. So we 
may use .3 as an approximation of the value of ~ If we sub- 

o 

tract .33 from we find the difference to be — • This 
a 300 

difference is smaller than — > so .33 is a better approximation 


to the value of - than .3 is. If we try each of the decimals in 
the sequence in turn, we get better and better approxi- 
mations to the number -• The longer the decimal is, the 
better the approximation becomes, because the difference 
from - gets smaller and smaller. To approximate means 
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literally to come close. By taking more and more digits in 

the decimal we get a number that is closer and closer to -• 

3 

In fact, we can get as close to ^ as we please by simply 

taking a decimal that is long enough. We describe the situ- 
ation by saying that the sequence of decimals approaches 

the number ^ as a limit, 
o 


We now have the means of explaining how an infinite 
decimal may represent a number. An infinite decimal repre- 
sents a number if the sequence of finite decimals obtained 
by taking more and more of its digits approaches that 
number as a limit. With this definition of the meaning of 
an infinite decimal, it may make as much sense as a finite 
decimal, and we can now answer the question, “Can every 
rational number be represented by a decimal?” The answer 
is, “Yes, by a finite or infinite decimal.” 


A Nest of Intervals 

The meaning of an infinite decimal can also be expressed 
pictorially in terms of our representation of the rational 
numbers as points on a line. It can be interpreted as a de- 
scription of the position of a number, or directions that may 
be followed in order to find it. 

Suppose, for example, we want to describe where the 

number - is located. Since it is a rational number, we can 

say first that it is somewhere on the line on which the rational 
numbers are represented. Then we try to specify where it is 
on the line. The integers on the line divide the line into 
intervals of unit length. We can assign a name to each in- 
terval by using the number that is attached to the end that 
is nearest to 0. We call the interval between 0 and 1 the 
“0 interval”; the interval between 1 and 2 the “1 interval,” 
and so on. On the negative side of the line, we call the 
interval between 0 and -1 the “-0 interval,” the interval 
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between —1 and —2, the “ — 1 interval.” Notice that 0 and 
— 0 refer to different intervals. 


Now we can be more specific about where the number 


1 

3 


is by saying that it is in the 0 interval. So we write down 0 
as the first part of a chain of directions. The next step in the 
description is to say where it is in the 0 interval. For this 
purpose, we divide the 0 interval into ten equal parts, each 
with length .1. We label these intervals in order, starting 
with the one nearest to the 0, as 0.0, 0.1, 0.2, 0.3, 0.4, 0.5, 
0.6, 0.7, 0.8, and 0.9. Now we learn from long division that 

- is more than 0.3, but less than 0.4, so it lies in the interval 


whose name is 0.3. The name of this interval incorporates 
both parts of the description we have given so far. It tells 

us that the number - is in the 0 interval, and that within 


Location of Va 


0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 


0 interval 


that interval it is in the sub-interval that has 0.3 as its left 
end. Now we narrow down its location once more. We divide 
the sub-interval into ten equal parts, each of width .01, and 
label them, in order, 0.30, 0.31, 0.32, 0.33, 0.34, 0.35, 0.36, 


0.37, 0.38, and 0.39. Again we learn from long division that - 

3 

is more than 0.33, but less than 0.34. So it lies in the interval 
whose name is 0.33. The name of this interval is a three part 


description. It says that the number ^ is in the 0 interval, 

and inside this interval it is in the 0.3 interval, and inside 
the latter interval it is in the 0.33 interval. Now we sub- 
divide this interval, and continue the process indefinitely. 
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Location of !6 


0.30 0.31 0.32 0.33 0.34 0.35 0.36 0.37 0.38 0.39 


0.40 


0.3 interval enlarged 


Each time we pick out the sub-interval that contains the 
point. According to this scheme, the infinite decimal 
0.33333 .... represents an infinite sequence of intervals, 
with the following characteristics: each successive interval 
lies inside the interval that precedes it in the sequence, and 
as we move along the sequence, the width of the intervals 
shrinks toward 0. We call such a sequence of intervals a nest 
of intervals. 

In this instance, we chose each interval in the nest as an 


interval that contains the number -• So we are sure that the 

O 


number - lies inside every one of the infinite number of 


intervals in the nest. We can also be sure that no other 
number can be in there with it, because two different 
numbers cannot lie in the same nest. This is so, because two 
different numbers are separated by a distance that is greater 
than 0. Since the inner intervals in a nest shrink in width 
toward a width of 0, ultimately the innermost intervals are 
too narrow to span the distance between two separate 
points, and cannot enclose them both. For this reason, the 
nest of intervals represented by the infinite decimal 

0.333 .... describes the number ^ and no other number. A 

O 

nest of intervals represents a number if that number is attached 
to the only point that lies inside all the intervals of the nest. 


A Nest for Every Rational Number 

By following the same procedure with any rational 
number, we can find a nest of intervals that contains the 
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number, and the nest of intervals can be represented by an 
infinite decimal. In this way we can get an infinite decimal 
for every rational number. This is true even for numbers 
that are represented by finite decimals. In fact, in these 
cases, the number can be represented by two infinite deci- 
mals. Suppose, for example, we look for a nest of intervals 
and the associated infinite decimal to represent the number 

2.3. First we notice that the number lies in the 2 interval 
which extends from 2 to 3. Now, when we subdivide this 
interval into ten equal parts, one of the points of division 
is actually 2.3. As a point of division, it belongs to two 
intervals. It is the right end of the interval whose name is 

2.2. It is also the left end of the interval whose name is 

2.3. So we may choose either one as the interval to which it 
belongs. After we have made that choice, we shall not be 
free to choose again. If we think of the number as belonging 
to 2.2, then forever after, as we subdivide the intervals, the 
number will always lie in the last subdivision. The infinite 
decimal in this case comes out 2.2999999 .... with an end- 
less series of nines. If we think of the number as belonging to 

2.3. then forever after, as we subdivide the intervals, the 
number will always lie in the first subdivision. Then the 
infinite decimal comes out 2.3000000 .... with an endless 
series of zeros. We get two names like this for every number 
that can be represented as a finite decimal, because a finite 
decimal turns up, sooner or later, as an endpoint of an in- 
terval in the nest. The two different ways of representing 
the number signify that such a number can be approached 
from two directions, from the left or from the right. 

A Nest That Has No Number 

We have answered in the affirmative the question, “Can 
every rational number be represented by a decimal?” We 
have found that it can be represented by an infinite decimal, 
and in some cases by two infinite decimals. In each case the 
infinite decimal represents a nest of intervals, and the 
rational number is attached to the only point that is inside 
the nest. 

Now let us reverse the question and ask, “Does every 
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infinite decimal represent a rational number?” Or, to ask 
the question in another form, “Does every nest of intervals 
defined by an infinite decimal have a rational number 
attached to the point inside the nest?” We can see im- 
mediately that the answer to this question is, “No.” We 
found before that there are some points on the line that have 
no rational number attached. The point whose distance 
from 0 is equal to the length of the diagonal of a unit square 
is such a point. Its location is pointed out by the arrow in 
the diagram. However, we can follow the same procedure 
with this point as we did with the point that represents the 

fraction ^ We can locate it precisely by means of a nest of 

O 



intervals, and the associated infinite decimal. We observe 
first from the diagram that the point lies between 1 and 2 
so the decimal begins with the number 1. Then we sub- 
divide the interval between 1 and 2, and observe that the 
point lies between 1.4 and 1.5. In fact, we can verify this 
fact by arithmetic, by observing that 1.4X1.4 = 1.96, a 
number that is less than 2, while 1.5 X 1.5 = 2.25, a number 
that is more than 2. So, 1.4 is too small to be the length of 
the diagonal of the unit square, and 1.5 is too large. By con- 
tinuing the process of subdivision, we get a nest of intervals 
and an infinite decimal that represents it. There is one and 
only one point inside this nest, the point whose distance 
from 0 is the length of the diagonal of the unit square. But 
there is no rational number attached to this point. So the 
infinite decimal that belongs to this nest does not represent 
a rational number. 
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The Real Number System 

We can think of an infinite decimal as a question. It asks, 
“If you make the nest of intervals that my digits describe, 
what number will you find inside the nest?” In the rational 
number system this question does not always have an 
answer. So, once again, we ask, “Is there a larger number 
system in which such a question does always have an 
answer?” To construct such a number system, we use the 
same device we have already used twice before. We make the 
question its own answer. We build a new number system 
in which each element is an infinite decimal. We call this 
system the system of real numbers. In general, each separate 
infinite decimal is a separate number. But, we know from 
our experience with the rational number 2.3 that we shall 
have to provide for some exceptions. We must specify that 
whenever a decimal ends with an infinite chain of nines, as 
2.99999 . . . does, the decimal obtained by replacing the 
chain of nines by a chain of zeros and adding 1 to the last 
digit before the chain will represent the same number. Thus 
2.300000 .... and 2.299999 .... will represent the same 
number. Under this agreement, every real number is either 
a single infinite decimal, or a pair of infinite decimals either 
one of which may be used to represent the pair. 

When we represented the rational numbers as points on 
a line, we found that there were gaps on the line, where no 
numbers were attached. Later we found that every point on 
the line can be located by an infinite decimal, and now we 
are converting every infinite decimal into a number. So, at 
one stroke, we are filling all the gaps on the line. The real 
number system gives us a number for every point on the 
line. 

So far we have merely defined a collection of elements, 
each of which is an infinite decimal or a pair of decimals. 
To convert this collection into a number system we have to 
give it the structure that every number system must have. 
We have to define operations of addition and multiplication 
on these elements, and we must show that the operations 
obey the five laws. To define the operations, we lean on the 
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smaller number system we already have as a crutch. Just 
as we used the whole numbers when we built the structure 
of the system of integers, and we used the integers when we 
built the structure of the system of rational numbers, we 
now use the rational numbers when we build the structure 
of the system of real numbers. We define addition and multi- 
plication of real numbers as follows: 

To add two infinite decimals, break up each decimal into 
a sequence of finite decimals, by using in succession, more 
and more of the digits in the decimal. These finite decimals 
represent rational numbers, and may be added by the rules 
for rational numbers. So, first add the first numbers in each 
sequence. The sum is a first approximation to your answer. 
Then add the second numbers in each sequence. This gives 
you a better approximation to your answer. As you proceed 
with the sequence of additions, you get a longer and longer 
finite decimal. The digits at the beginning of the decimal 
may fluctuate at first, but then settle down so that ultimately 
a fixed digit is defined for each decimal place. In this way a 
specific infinite decimal is obtained as the sum. To multiply 
infinite decimals, multiply the successive pairs of finite 
decimals in the same way. 

For example, to add 0.2222222 .... and 0.8888888 . . . ., 


we proceed by these steps 



0.2 

0.22 

0.222 

0.2222 

0.22222 0.222222 

+0.8 

+0.88 

+0.888 

+0.8888 +0.88888 +0.888888 

1.0 

1.10 

1.110 

1.1110 

1.11110 1.111110 

The sum is clearly the infinite decimal 1.111111 . . . . in 
which all the digits are ones. To multiply these same 
decimals, we proceed by these steps : 


0.2 

0.22 

0.222 

0.2222 


X0.8 

X0.88 

X 0.888 

X 0.8888 


0.16 

176 

1776 

17776 



176 

1776 

17776 



0.1936 

1776 

17776 




0.197136 

17776 
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0.19749136 


We see that the product begins with 0.197, and more of the 
digits are fixed when more of the steps are taken. 

At each step in the addition or multiplication just defined, 
we are using rational numbers. The rational numbers obey 
the five laws. As a result, the five laws are extended to the 
real numbers as well. For example, 0.2 + 0.8 and 0.8 + 0.2 
lead to the same sum; 0.22 + 0.88 and 0.88 + 0.22 lead 
to the same sum; etc. So 0.222 . . . + 0.888 . . . and 0.888 
. . . + 0.222 . . . lead to the same infinite decimal as sum. 
In other words, real numbers obey the commutative law of 
addition. The other four laws can be established by a similar 
argument. 

We Still Have the Rational Numbers 

We constructed the real number system in such a way that 
we have a number for every point on the number line. 
Among the points on the line are those to which we had 
previously assigned rational numbers. Let us refer to these 
as the rational points on the line. The real numbers assigned 
to the rational points form a subset of the real number 
system that is isomorphic to the rational number system. 
For all practical purposes, they are the “same” as the 
rational numbers. The symbol for a rational number is a 
fraction or a finite decimal. Such a symbol is easier to write 
and work with than an infinite decimal, so we use the rational 
number rather than the infinite decimal to represent the 
real number attached to a rational point. But first we must 
learn to recognize which of the real numbers belong to the 
rational points, and are therefore entitled to this simpler 
representation. 

The infinite decimals that represent rational points are 
those which, after a finite number of digits in the decimal, 
simply repeat a fixed block of one or more digits over and 
over again. For example, the following decimals, in which 
the repeating block has been italicized, all represent ra- 
tional numbers: 0.S333 . . ., 0.121212 . . ., 2.37^54545 
To prove this fact, we have to show, first, that every rational 
number, represented by a fraction, can be written as a re- 
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peating decimal. Then, vice versa, we must show that every 
repeating decimal can be written as a fraction. 

To show that a fraction can be written as a repeating 
decimal, recall that we can convert a fraction into a decimal 
by long division, dividing the denominator into the numer- 
ator. In the long division process there is a step involving 
subtraction, after which we carry down a digit from the 
dividend. The dividend is an integer with a finite number of 
digits. So these are soon exhausted. Then we begin carrying 
down the zeros that appear after the decimal point. Con- 
sider what happens after we reach this stage of carrying 
down only zeros. In the subtraction step there is a remainder 
that is less than the divisor, and this remainder determines 
what the next number in the quotient will be. Since the re- 
mainders must be less than the divisor, the list of possible 
remainders is a restricted finite list. But, as we proceed with 
the division, we get an endless succession of remainders. So 
we cannot keep getting a different remainder each time. 
Sooner or later, a remainder that turned up before is re- 
peated, and the division process begins to repeat itself. 

To show that every repeating decimal can be written as 
a fraction, we shall work out one specific example showing 
how it is done. It will be clear that the method used can be 
employed with any repeating decimal whatever. 

Suppose we find the fraction that represents the repeating 
decimal 2.7151515 .... We first split this decimal into two 
parts, 2.7, and .0151515 . . . ., separating the non-repeating 

27 

part from the repeating part. The first part is the fraction 

Now we find a fraction for the second part. First multiply it 
by ten, so that the repeating block will begin right after the 
decimal point. Let us call the result x, and remember that 
it is ten times as big as the number we are looking for, so 
after we find x we must divide by ten. x = .151515 .... 
Now multiply both sides of this equation by 100. This has 
the effect of moving the decimal point two places to the right. 
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We get lOOx = 15.1515 . . . which may also be written as 
lOCte = 15 + .1515 .... But the decimal in this equation is 
none other than x all over again. So we may write lOOx = 
15 + x. Taking x away from both sides, we find that 

99x = 15. Dividing by 99 on both sides, we find that x = ^ 

5 

or — • Now we divide by ten, to find that the second part 


of our original number is — • So the infinite decimal 

ooU 

27 5 

2.7151515 ... is the sum of — and — -■ Therefore 2.7151515 

1U ooU 

.... represents the rational number or 777- To check 

330 lb5 

the result, divide 165 into 448. 


We Still Have a Field 

The rational number system has the structure of a field. 
Enlarging the number system has not destroyed this struc- 
ture, because the real number system, too, is a field. We 
can verify that it has the characteristics of a field, one by 
one. In the first place it is a commutative group with the 
operation of addition. The zero element in the group is the 
infinite decimal 0.0000 . . . ., which we may write briefly as 
0, and every infinite decimal has a negative, namely, the in- 
finite decimal written with the same digits in the same order, 
but having the opposite sign attached. For example, the 
negative of .333 ... is —.333 ... It is also a ring, because 
the multiplication is distributive with respect to addition. 
The unity element may be written in two ways: 1.00000 

. . . ., or 0.9999 Moreover, every element except 0 

has a reciprocal, so the system is a field. To find the recipro- 
cal of an infinite decimal, we can use the successive finite 
decimals that approximate it, and divide each into the num- 
ber 1. The quotients we get serve as successive approxi- 
mations of the reciprocal, and, one by one, we can identify 
the digits in the decimal that represents it. In special cases 
we have simplified ways of doing it. For example, to find the 
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reciprocal of V2, we write it first in fraction form as 


V2 


V2 


The value is unchanged if we multiply by because this 
multiplier is equal to 1, or the unity element. But then we 

V2 

have — -> whose decimal equivalent is easily found by 

A 

dividing 2 into the decimal for V2. The decimal for V2 
begins as 1.414 . . ., so the decimal for its reciprocal begins 
as .707 .... 


A Number in Every Nest 

The system of real numbers has some special properties 
that the rational number system does not have. The most 
convenient way of expressing these properties is in terms of 
the picture we have set up of numbers as points on a line. 
When we represented the rational numbers as points on a 
line, we found that there were gaps that it left unfilled. That 
is, there were points on the line that had no numbers at- 
tached to them. The real number system was deliberately 
designed to eliminate this defect. In this system, not only 
do we have a point for every number. We also have a number 
for every point. There is a one-to-one correspondence be- 
tween the real number system and the points on the number 
line. Because of this correspondence, we may think of the 
real numbers as the points on the line, and can describe the 
properties of the real number system in terms of relation- 
ships of the points on the line. 

For example, by making every infinite decimal an element 
of the real number system, we assured the fact that there 
would be a real number for every such decimal. This property 
can also be described in terms of points on the line, as we 
have seen : an infinite decimal represents a nest of intervals 
on the line, and for every such nest, there is one and only 
one point that lies inside every interval of the nest. In this 
statement, the nest referred to is a nest associated with an 
infinite decimal. The successive intervals in such a nest 
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have a special length, viz., 1, .1, .01, .001, etc., and their 
endpoints are always finite decimals. However, it is possible 
to form nests of intervals of a more general character by 
removing these restrictions on the lengths of the intervals 
and the locations of their endpoints. The only requirements 
for calling a sequence of intervals a nest are that the suc- 
cessive intervals lie one inside the other and that the lengths 
of the inner intervals shrink toward 0. It can be proved that 
in the real number system, all nests have the same property 
we have found for nests associated with infinite decimals: 
I here is one and only one point that lies inside every inter- 
val of the nest. In this sense, every nest of intervals defines 
a single real number. Some of the other properties of the 
real number system that we shall now examine are closely 
related to this fact. 

Infinite Series 

Addition is an operation defined on a pair of numbers, so, 
initially, we can add only two numbers at a time. However, 
by performing one addition after another, we can extend the 
operation to include any finite number of numbers. For ex- 
ample, there is no difficulty about finding the sum of these 
numbers: l + l + l + l + l + l. The sum is, of course, 6. 
However, if we permit the series of ones to go on indefinitely, 
in the infinite series l + l + l + l + l + l...., then we 
run into trouble. The step by step addition that we can carry 
out with a finite number of terms doesn’t work here, because 
it never comes to an end. We are left then with the question: 
Does it make any sense at all to try to add an infinite series 
of terms? The answer turns out to be that sometimes it 
does, and sometimes it doesn’t. We can get clues to when an 
infinite series has meaning as a sum by re-examining an 
infinite series with which we are already familiar, namely, 
an infinite decimal. 

The infinite decimal .3333333 is really an infinite 

series in disguise. In fact, we may think of it as merely an 
abbreviated way of writing the infinite series .3 + .03 + 
.003 + .0003 + .00003 On page 97 we also inter- 
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preted the infinite decimal as a sequence of finite decimals, .3, 

.33, .333, .3333, These finite decimals are the sums 

we get when we add a finite number of terms in the infinite 
series, using the first term alone, then the first two terms, 
then the first three, and so on. We call these sums the partial 
sums of the series. We found that these partial sums come 


closer and closer in value to the fraction approaching this 

O 



decimal. In a similar way, we can assign a meaning to some 
other examples of infinite series. If the partial sums of a series 
come closer and closer, in the long run, to some definite num- 
ber, approaching this number as a limit, then we can assign 
this limiting number as the sum of the infinite series. 

I or example, let us consider the series, l + f + f + f-f- 
xe + • • • in which each term is half the size of the term that 

it follows. The partial sums are, 1, If, If, If, Iff, Iff, 

In this case, the partial sums come closer and closer to the 
number 2. The first sum differs from 2 by 1. The second sum 
differs from 2 by \ . The third sum differs from 2 by Suc- 
cessive partial sums differ from 2 by smaller and smaller 
amounts, and the difference can be made as small as we 
please if we add up enough terms in the series. So the partial 
sums approach the number 2 as a limit. We therefore can 
assign the number 2 as the sum of the series 1 + UU 

1 4- 1 4- I2T4-T 

8 t if t . . . . 

Let us try to use the same procedure with the series 1 -j- 

1 + 1 + 1 + The partial sums for this series form 

the sequence, 1, 2, 3, 4, 5, 6, In this case, there is no 

number that the partial sums approach as a limit, because 
they keep increasing without limit. That is, we can get a 
partial sum to be larger than any number we wish by simply 
adding enough terms of the series. Because the partial sums 
do not approach a limit, we cannot assign any number as 
the sum of this series. So we see that not every infinite series 
has meaning as a sum. An infinite series has meaning as a 
sum only if the partial sums of the series approach a limit. 
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In that case we call the series a convergent series, and the 
sum of the series is the limit approached by the partial sums. 

Now that we know that only some infinite series have 
meaning as a sum, the next logical question to ask is, “Which 
ones?” How do we recognize an infinite series that converges? 
We find that we can recognize some very easily by the fact 
that they define a nest of intervals. For example, consider 
the series 1 — § + ^ — ? + £ — $ + ...., in which the suc- 
cessive terms decrease toward 0 and the signs alternate be- 
tween + and — . As we form the partial sums, let us locate 
them as points on the line of real numbers. The first partial 
sum is the number 1. Because the next term, — §, is negative, 
the second partial sum is to the left of the first. Adding the 
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third term, §, brings us back to the right, but not all the way 
back, because l is smaller than As we add in more and 
more terms, the point representing the partial sum oscillates 
back and forth, left and right, but never again as far to the 
left as before, and never again as far to the right. Now con- 
sider the intervals that have successive partial sums as their 
endpoints. The first interval is bounded on the right by 1 
and on the left by 1 — The second interval is bounded on 
the left by 1 — f, and on the right by 1 — ^ + ^. The third 
interval is bounded on the right by 1 — ■§ + |, and on the 
left by 1 — | + f — j. Each new interval is inside the one 
that it follows. Moreover, the width of the intervals is 
shrinking toward 0. The intervals form a nest, and, as we 
know, there is one and only one point inside the nest. The 
partial sums are crowding in toward this point as a limit, 
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so the series 1 — § + § — f + . . . is a convergent series. 

A similar argument applies to any series in which the 
terms decrease toward zero, and the signs are alternately + 
and — . Every such series defines a nest of intervals, and 
converges to the single point that is inside the nest. This 
result, which we have established for the real number system, 
does not hold for the rational number system, because there 
it is not true that every nest of intervals has a point inside. 

Another type of series that is easy to analyze is one in 
which all terms are positive, and the partial sums are 
bounded, that is, they are always less than some fixed 
number. The series l + | + ? + | + ...isan example of 
this type, because all partial sums are less than 2. Let us 
represent the general series of this type by Oi -f a 2 + 
a 3 + ... . where the subscripts 1, 2, 3, etc. are labels to 
identify the position of each term in the series. The partial 
sums are a u cu + a 2 , ai + a 2 + a 3 , . . . Since we get each 
partial sum from the one that precedes it by adding a positive 
number, the partial sums form an increasing sequence. Let 

us represent the partial sums by the symbols S u S 2 , S 3 , S it 

That is, Si = oi, <S 2 = ai + a 2 , S 3 = ai + a 2 + a 3 , and so on. 

If we represent the partial sums as points on the line of 
real numbers, since the numbers are increasing we get a 
succession of points moving gradually to the right. S 2 is to the 
right of Si, S 3 is to the right of S 2> and so on. However, since 
the sums are all less than some fixed number, they cannot 
move too far to the right. If the fixed number is represented 
by K on the number line, its position is an upper boundary 
beyond which the sequence of partial sums cannot go. Now 
we show how we can use K and the partial sums to define a 
nest of intervals. The points Si and K are the ends of an 
interval. This is the first interval of the nest. All the S points 
after Si lie to the right of Si and to the left of K, so they are 
inside this interval. Let us divide this interval in half. The 
points in the sequence of S’s are moving to the right. Either 
they finally enter the second half of the interval, or they do 
not. If they do enter the second half, once they enter they 
stay there, because they keep moving to the right, and never 
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get past K. In that case we use this second half of the inter- 
val as the second interval of the nest. If the S points never 
enter the second half, that means they always remain in the 
first half. Then we use the first half as the second interval 
of the nest. Now we repeat the process. We divide the 
second interval of the nest in half, and pick one of the halves 
as the third interval of the nest. We choose the second half 
if the S’s ultimately enter it. We choose the first half if they 
never enter the second half. 

In this way we get a sequence of intervals with the char- 
acteristics of a nest: the intervals are one inside the other, 
and the width of the intervals is shrinking toward zero. Then 
there is a single real number that lies inside all the intervals 
of the nest. Since we know from the way in which we chose 
the intervals of this nest that the partial sums enter and 
remain in each of them, then the partial sums converge 
toward this single number as a limit. Therefore an infinite 
series of positive terms whose partial sums are bounded con- 
verges to a limit. This result can also be restated in terms of 
the sequence of partial sums alone, without reference to the 
series from which the sums were derived: Every increasing 
sequence of numbers that is bounded on the right converges 
to a limit. 

The two types of convergent series just examined are only 
special cases. It is not difficult, however, to find a criterion by 
which all convergent series can be recognized. Suppose the 
series is represented by ai + a 2 + a 3 . . . ., where the terms 
may be either positive or negative. Whenever we form a 
partial sum, we are breaking the series up into two parts. 
The first part consists of a finite number of terms at the 
beginning of the series, taken in order, and added to get 
the partial sum. Let us call this part the head end of the 
series. The second part consists of all the remaining terms, 
that are not used to form this partial sum. Let us call this 
part the tail end of the series. We can designate the suc- 
cessive partial sums, formed from the head end as more and 
more terms are included in it, by S u S 2 , S 3 , etc. Let us call 
the corresponding tail ends T h T 2 , T 3 , etc. Then 
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51 = a h with tail end TV a* + a 3 + a 4 . . . 

52 = ai + a 2 , with tail end T 2 : a 3 + a 4 + a 6 + . . . 

<S 3 = ai + a 2 + a 3 , with tail end TV a 4 + a 6 + a 6 + . . . 

As we move along the sequence of partial sums, Si, S 2 , etc., 
we take one term at a time from the tail end and transfer it 
to the head end to be included in the partial sum. As a re- 
sult, as n increases, the value of the partial sum, S n , keeps 
changing. If the last term that is transferred is positive the 
change is an increase. If the term is negative, the change is 
a decrease. If we disregard the sign of the term we get a 
positive number that tells us the size of the change without 
regard to whether it is a decrease or increase. This positive 
number is called the absolute value of the change. Thus, 
an increase by \ or a decrease by § both have an absolute 
value of 5. 

The series converges if the partial sums approach a 
limit. If the partial sums approach a limit, they become more 
and more nearly equal to that limit. This means that, as 
more and more terms are included in the partial sum, the 
sum changes less and less. And if n is taken large enough, the 
partial sum S n is so close to the limit, that it changes very 
little no matter how many more terms are transferred from 
the tail end of the series to the head end. In fact, if n is large 
enough we can keep the absolute value of this change in 
the partial sum as small as we please. The partial sum S„ is 
equal to Oi + a 2 + . . . + a n . The tail end T n consists of the 
series a n+) + a„ +2 + a n+3 + .... If we transfer the first p 
terms from the tail end to the head end, we add to S n the 
sum o„+ 1 + a n+2 + . . . + a„ +p . So, if the series converges, 
the absolute value of this sum must shrink toward zero as n 
increases, no matter how many terms from the tail end it 
includes. The converse is also true. If the absolute value of 
the sum of the first p terms of the tail end of the series 
shrinks to zero as n increases, no matter how large p is, then 
the series converges. This fact can be proved by showing 
that, under these conditions, there is a nest of intervals into 
which the partial sums crowd, so that they converge on the 
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single point that is inside the nest. This criterion for a con- 
vergent series is known as the Cauchy criterion, and may be 
summed up somewhat carelessly in these words: in the real 
number system, an infinite series converges to a limit if and 
only if it has a shrinking tail end. 

Limit Points and Neighborhoods 

So far we have passed through four stages in the enlarge- 
ment of our number system. We started with the system of 
natural numbers. Then by successive extensions, we ob- 
tained the integers, the rational numbers, and the real 
numbers. This sequence of extensions, besides showing us 
how our notion of number has been evolving, also served to 
introduce us to a variety of mathematical structures. In the 
natural number system, we first encountered the structure 
which we have labeled a “number system” and which is dis- 
tinguished by its obedience to the five laws. In the system of 
integers we found an example of both a group and a ring. The 
system of rational numbers was our first example of a field. 
Now, in the system of real numbers, we shall get acquainted 
for the first time with another type of structure, a topo- 
logical space. A real number is an infinite decimal, and an 
infinite decimal is an infinite sequence of finite decimals. 
This fact compelled us to look into the question of when an 
infinite sequence converges to a limit. Now, as we examine 
more closely the notion of a limit, it will lead us to the 
concept of a topological space. 

Suppose we look at the sequence of numbers, 1, |, . ., 

in which the nth number is -• As n increases, these numbers 

n 

become smaller and smaller and approach 0 as a limit. If 
we represent the numbers in the sequence as points on a line, 
the fact that they approach 0 as a limit shows up in the fact 
that the points crowd in toward the zero point. This crowd- 
ing can be described without any reference to the order in 
which the numbers are arranged in the sequence. In fact, 
let us discard the sequence and merely think of the numbers 
as the set of points on the line. Let us give this particular 
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set the name A. To define in what sense the points in the 
set A crowd in toward the zero point, we first introduce the 
notion of a neighborhood of a point. 

A neighborhood of a point consists of all the points that 
surround it and are close to it. To make this concept more 
precise, we have to indicate what we mean by close. So we 
specify degrees of closeness, and each such specification 
defines a particular neighborhood. For example, we define 
a neighborhood of zero when we say it consists of all the 
points whose distance from zero is less than 1. This neighbor- 
hood includes all points that are larger than (to the right of) 
— 1 but less than (to the left of) +1. In the diagram, we 
see this neighborhood as the interval between —1 and +1, 
not including its endpoints. We represent this neighborhood 



-l o i 



_i /2 o v 2 


Neighborhoods of zero 


by the symbol ( — 1, 1), in which its boundary points are 
shown. We define another neighborhood of zero w r hen we 
specify that it consists of all points whose distance from zero 
is less than This is a smaller neighborhood than the first, 
one, and is included within it. All of its points lie between 
— \ and +|. We can represent it by the symbol ( — f, ^). 

By picking any positive distance d, we can define a neigh- 
borhood of zero as the set of points lying between —d and 
+d, and we represent it by the symbol ( — d, d). So we see 
that the zero point is surrounded by a multitude of neighbor- 
hoods of many sizes. 

Now we can say exactly what is meant by the fact that 


the points in the set A ^A = |l, §, f, crowd 

in toward zero: If we pick any neighborhood of zero, no 
matter how small, all but a finite number of these points lie 
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within that neighborhood. If we choose smaller and smaller 
neighborhoods, we can exclude some of the points of the set 
A from the neighborhood. But, no matter how many we 
exclude, all but a finite number of the points are still inside. 
This is the meaning of the fact that 0 is the limit point of 
the set A. 

Now let us examine a set in which we find crowding of a 
different kind. Let B stand for the set 

|l+|, 1+i 2-|, 1 + 1,2-i, . . . , 1+-, 2 — -> . . A- 

l n n ) 

In this set, we find the points crowding around two points of 
the line, 1, and 2. Here we cannot say that every neighbor- 
hood of the point 1 includes all but a finite number of points 
in the set. In fact, while the neighborhood (f , 1§) includes 
an infinite number of points of the set B, it also excludes an 
infinite number, because there are multitudes of them 
clustering around the point 2. To distinguish this case from 
the preceding case, we call the points 1 and 2 duster 'points of 
the set B. A point is a cluster point of a set of points if every 
neighborhood of the point includes an infinite number of 
points of the set. As we see, a set may have more than one 
cluster point. If a set is confined within a finite interval, 
and has only one cluster point, then that single cluster point 
is the limit point of the set. 

As a contrast to the behavior of a limit point or a cluster 
point, let us examine the point —1 in relation to the set A. 
Surround the point —1 by the neighborhood that extends 
from — 1| to — §. Within this neighborhood, represented by 
the symbol ( — 1^, — §), there are no points of the set A. 
That is, the neighborhood ( — If, — |) excludes all points of 
the set A. On the other hand, there is no neighborhood of 
the point 0 that excludes all points of the set A. Similarly, 
there is no neighborhood of the point 1 that excludes all 
points of the set B, and there is no neighborhood of the 
point 2 that excludes all points of the set B. Because they 
have this property, a limit point and a cluster point are 
both examples of what we call a point of adherence of a set. 
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A point is called a point of adherence of a set of points if 
no neighborhood of the point excludes all points of the set. 
A point need not be a limit point or a cluster point of a set 
in order to be a point of adherence of the set. For example, 
the point 1| is not a cluster point of the set B, but it is a 
point of adherence. No neighborhood of 1| can exclude all 
points of the set B because the point 1| is itself a member 
of the set. The points of adherence of a set include all 
members of the set as well as cluster points that are not in 
the set. They are the points that cling to a set by either 
being in it or being crowded by it. 

Closed Sets and Open Sets 

With the help of the concept of point of adherence, we 
now define two special kinds of sets of points on the line. A 
set that contains all of its points of adherence is called a 
closed set. The set A, discussed in the paragraphs above, is 
not a closed set, because the point 0 is a point of adherence 
of the set but does not belong to it. However, if we enlarge 
the set by including 0 as a member, then the enlarged set 
is closed. Similarly, the set B is not closed. But the enlarged 
set formed by uniting B with the set {1, 2} is closed. An- 
other example of a closed set is the set of points between 0 
and 1, including the points 0 and 1. Such a set is called a 
closed interval. The set which contains all the points on the 
line of real numbers is also a closed set. We can count the 
empty set as a closed set, too. It certainly includes all its 
points of adherence, because there aren’t any. 

If we delete from the real number line all the members of 
some closed set, what is left is called an open set. Using the 
terminology defined in Chapter II, we can say that an open 
set is the complement of a closed set. To see what an open 
set is like, we have to think of it in relation to the closed 
set which is its complement. Suppose S is an open set, and 
C is the closed set which is its complement. Any point of the 
open set S is not in the closed set C. Therefore any point in 
the open set S is not a point of adherence of C. (If it were, 
it would have to be in C, by the definition of a closed set.) 
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But if a point is not a point of adherence of C, then some 
neighborhood of the point contains no points of C. This 
neighborhood, which contains no points of C, must then be 
part of the open set S, which consists of all the points that 
are not in C. So we have discovered that every point of an 
open set is surrounded by an entire neighborhood that is 
also in the open set. This is a distinguishing feature of open 
sets on a line. 

A neighborhood is itself an example of an open set. For ex- 
ample, the set of points between 0 and 1, not including 0 and 
1 , is an open set. It is called an open interval to distinguish 
it from the closed interval which does include both end 
points. Another example of an open set is the set formed by 
uniting into one set all points in any collection of open 
intervals. The whole line of real numbers is also an open 
set, because it is the complement of the empty set, which is 
a closed set. The empty set, too, is an open set, because it 
is the complement of the whole line, which is a closed set. 
We see then that the empty set and the whole line are both 
open and closed. 

There are some sets that are neither open nor closed. For 
example, the set of points between 0 and 1, including the 
endpoint 0, but not including the endpoint 1, is neither open 
nor closed. It is not closed, because 1 is a point of adherence 
of the set, but doesn’t belong to it. It is not open, because 
0 is a point of adherence of the complement of the set, but 
does not belong to the complement, so the complement is 
not closed. 

The collection of all open sets on the line has the following 
properties, some of which we have already noted: 

1) The whole line, as well as the empty set, are open sets. 

2) The union of any number of open sets is also an open set. 

3) The set of points common to two open sets (the inter- 
section of the two sets) is an open set. 

Neighbors Make a Neighborhood 

The significance of the concept of an open set is that it 
permits a generalization of the concept of neighborhood. 
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We have already seen that a neighborhood is an open set. 
Let us agree to extend the word neighborhood to include 
every open set. The effect of this generalization is to separate 
the notion of neighborhood from the idea of distance. Then 
a neighborhood becomes simply a collection of neighbors, 
with the characteristics of an open set. From this point of 
view, the whole line may be viewed as a system of inter- 
locking neighborhoods or open sets. The interlocking neigh- 
borhoods on the line determine what is called its topological 
structure, and make it a topological space. They fix a pattern 
of relationships within the space just as the interlocking 
stitches in a sweater fix the pattern of the sweater. 

In general, any set of objects is called a topological space 
if a collection of its subsets are singled out so that the col- 
lection has the three properties we found in the open sets 
on the line: 1) The whole space and the empty set belong 
to the collection; 2) The union of any number of sets in the 
collection is also in the collection; 3) The intersection of 
any two sets in the collection is also in the collection. When 
these three conditions are satisfied, the sets in the collection 
are called the “open sets” of the “space.” 

Under this definition, any collection of objects can be 
converted into a topological space, usually in more than one 
way. For example, let us consider the set \x, 2, #, *}. Its 
elements were chosen arbitrarily, so they have no relation- 
ship to each other beyond the fact that they happen to have 
been thrown together in the same set. The set is a loose 
aggregation, and has no structure. However, the set acquires 
a structure, and the elements become related to each other 
as neighbors, as soon as we single out certain subsets that 
will be called neighborhoods or open sets. 

For example, we might specify that these four sets should 
constitute the collection of open sets: {x, 2, #, *}, { x, 2 j , 
{#,*}, and { } . This collection satisfies the three require- 
ments listed above. 1) The whole space, {x, 2, #, *}, and the 
empty set, { } , are members of the collection. 2) The union 
of any number of sets in the collection is also in the collection. 
For example, the union of { x, 2 } and {#, *} is {x, 2, #, *}, 
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which is in the collection. 3) The intersection of any two 
sets in the collection is also in the collection. For example, 
the intersection of {x,2} and {#, *} is the empty set { }, 
which is in the collection. Since the requirements are met, 
this collection of “open sets” defines a topological structure 
for the set {x, 2, #, *} and converts it into a topological 
space. The way the topological structure relates the ele- 
ments of the space to each other as members of neighbor- 
hoods is indicated in the diagram below, where each open 
set is represented by a loop enclosing its members. 



We can give the same set another, different topological 
structure by picking out other subsets to use as “open sets.” 
We might, for example, decide that we want the collection 
of open sets to include all those listed above, and in addition 
the set {x, #} . The inclusion of this one additional set among 
the open sets compels us to include more sets, in order to 
meet the three requirements. The union of {x, #} and \x,2) 
is {x, 2, #}. To meet requirement 2), we must include it 
among the open sets. The intersection of { x, § } and { x, 2 } 
is the set {x}. To meet requirement 3), we must classify it 
as an open set. For similar reasons, we have to include 
\x,#,*\ and {#} among the open sets. We find that all three 
requirements are met now by the enlarged collection con- 
sisting of these nine sets: 

{x,2,#,*}, {x, 2}, {#,*}, { }, 

{x, #}, {X, 2, #}, {x, #,*}, {x}, {#}. 

The enlarged collection of open sets gives the space a more 
complicated structure of interlocking neighborhoods, as 
shown in the diagram on page 122. 

A third topological structure can be defined for the same 
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set by specifying that every subset shall belong to the col- 
lection of open sets. This definition meets the three require- 
ments, because the whole set and the empty set are both 
subsets; the union of any number of subsets is a subset; and 
the intersection of any two subsets is a subset. Under this 
definition, the set of four elements becomes a topological 
space containing sixteen open sets, including the empty set. 
The three topological structures we have defined convert 
the same set of four elements into three different topological 
spaces. They are different as spaces because, in each of the 
topological structures, the elements hang together differently 
as members of interlocking neighborhoods. 

Here is another example of a topological space. Consider 
all the straight lines that can be drawn in a plane parallel 
to some fixed direction. We can convert this set of lines into 
a topological space, in which each of the lines is a “ point ,r 
of the space, by defining for the space a structure of inter- 
locking open sets. Let an open set consist of a set of lines be- 
tween any two lines, or a union of any number of such sets. 
In this space, a set of lines between any two lines is analo- 
gous to what we called an open interval in the real number 
system, namely, a set of points between any two points. 

A more familiar looking example of a topological space 
is the circumference of a circle, in which arcs play the same 
role that intervals do on a straight line. On a circle, there 
are two arcs joining any two of its points. The interior 
(excluding the endpoints) of the arc may serve as an “open’' 
arc just as the interior of a segment serves as an open in- 
terval on a straight line. An open set on the circle can then 
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be defined as the union of any number of such open arcs. 
With this definition, the circle is furnished with a topological 
structure that meets the three requirements listed on page 
120. This topological structure on a circle can also be derived 
from the topological structure on a straight line in which 
each open set is a union of open intervals. Take a segment 
of a straight line, and loop it to make a circle by joining the 
two ends together. Then open intervals on the straight line 
become open arcs on the circle. 

By isolating the topological structure of the real number 
system for separate study, mathematicians have obtained a 
deeper insight into the characteristics of that system. They 
could separate from each other those properties that the 
system has by virtue of the fact that it is a topological space, 
those that it has by virtue of the fact that it is a field, and 
those that it has because it happens to be both. The separate 
study of topological structures also develops a body of 
knowledge that is applicable to other topological spaces, 
no matter what the elements of these spaces may be. 

Rubber-Sheet Geometry 

Now that we have introduced the notion of a topological 
space, it is relevant to ask, “When are two topological 
spaces essentially the same?” The problem is analogous to 
the problem we encountered when we were talking about 
groups, rings, or fields, and has a similar answer. We said 
that two fields are essentially the same, or are isomorphic, 
if there is a one-to-one correspondence between them that 
preserves the field structure embodied in the operations of 
addition and multiplication. Similarly, we say that two 
topological spaces are essentially the same, or are homeo- 
morphic, if there is a one-to-one correspondence between 
them that preserves the topological structure embodied in 
the system of interlocking open sets. That is, two topological 
spaces are homeomorphic if, under a reversible mapping 
which establishes a one-to-one correspondence between 
them, an open set in either space has as its image an open 
set in the other. For example, imagine the circumference of 
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a circle being stretched like a rubber band, into a distorted 
shape. Each point of the circle may take up a new position. 
Distances between points change, but the interlocking 
system of open sets remains. As a result the distorted curve 
obtained is homeomorphic to the original circle. Because 
the topological structure of a space is not changed when it 
is stretched without tearing, the study of topological struc- 
ture has sometimes been referred to as “rubber-sheet 
geometry.” 

DO IT YOURSELF 

1. By an argument similar to the one used on page 94, prove 
that there is no rational number equal to the square root 
of 3. 

2. Find the repeating decimal that represents the rational 
number 

3 What rational number is represented by the repeating 
decimal .181818. . . . ? 

4. Define a topological structure for the set of five elements 
{ a, e, i, o,u\ by designating some but not all of the sub- 
sets as “open” sets. (See example on page 120.) Be sure 
your open sets meet the requirements for a topological 
structure listed on page 120. Which sets are closed sets 
(complements of open sets) in this structure? 
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CHAPTER VI 


Spilling Into the Plane 


IN THE system of rational numbers we were able 
to solve the equation x 2 — 1 = 0, but we were unable to 
solve the equation x 2 — 2 = 0. We remedied this defect by 
constructing a larger number system, the system of real 
numbers, in which the equation x 2 — 2 = 0 does have a 
solution. In fact, it has two solutions in the real number 
system, viz., +\/2, and — \/2. But the real number sys- 
tem has some defects of its own. For example, although the 
equation x 2 + 1 = 0 does not look any more complicated 
than the other two equations mentioned above, it has no 
solutions in the real number system. We can see why this 
is so, by restating the question that the equation asks us. 
First we add —1 to both sides of the equation, and we get 
the equivalent equation x 2 = — 1. This equation says, 
“Find a number which, when multiplied by itself, gives 

1 as the product.” The real number 0 does not meet the 
requirement, because 0 multiplied by itself gives 0 as the 
product. A positive real number can not meet the require- 
ment, because a positive number multiplied by itself gives 
a positive number as the product. A negative real number 
cannot answer the question either, because a negative num- 
ber multiplied by itself also gives a positive product. For 
example ( — 1) • ( 1) = +1. So there is no real number 
that can satisfy the equation x 2 + 1 = 0, or x 2 = —1. Our 
next goal is to construct an expanded number system in 
which this equation does have a solution. 

At each stage in the expansion of the number system so 
far we have represented numbers pictorially as points on a 
line. With the construction of the real number system, we 
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finally achieved our purpose of having a number for every 
point on the line. Another possible goal for the next step 
in the expansion of the number system would be to spill 
over into the plane, by constructing a number system that 
gives us a number for every point in the plane. It turns out 
that these two goals coincide. We shall reach the goal of 
finding a solution to the equation x 2 + 1 = 0 by building a 
number system that supplies a number for every point in 
the plane. 

Number Pairs and Arrows 

The construction of a number system that has a number 
for every point in the plane will be carried out in two in- 
stallments. We shall do only half of the job in this chapter, 
by creating an appropriate system of elements, and defining 
an addition operation on these elements. We shall finish 
the job in Chapter VIII when we define a multiplication 
operation for these elements. The elements w r e shall use are 
ordered pairs of real numbers, like (0, 1), (—2, 3|), or 
(1, y/~2). We assign such a pair to each point in a plane by 
the familiar method used in elementary algebra to iden- 
tify the co-ordinates of a point on a graph. First we draw in 
the plane a straight line called the x axis. We furnish it with 
a scale on which measurements can be made by assigning a 
real number to each point on the line, in the manner de- 
scribed in the preceding chapters. Then we draw another 
line passing through the 0 of the x axis, and crossing the 
axis at right angles. This new line is called the y axis. The 
point where the axes cross is called the origin. We put a 
scale on the y axis, too, using the origin as its zero point, 
and putting the positive numbers on the upper half of 
the axis. 

Now we assign a pair of numbers to each point in the 
plane in the following way. Drop a perpendicular from a 
given point to the x axis. We can reach the given point 
from the origin by moving toward it in two steps. First 
move from the origin to the foot of the perpendicular. The 
distance moved supplies the first number of the ordered 
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pair. It is a positive number if the motion is to the right of 
the origin. It is a negative number if the motion is to the 
left of the origin. Now, from the foot of the perpendicular, 
move along the perpendicular to the given point. The 
distance moved in this second step supplies the second 
number of the ordered pair. It is a positive number if the 
motion is up from the x axis. It is a negative number if 



the motion is down from the x axis. The pair of numbers 
is written inside parentheses. The first number in the pair is 
called its x component. The second number is called its y 
component. The ordered pairs that belong to some points 
in the plane are shown in the diagram below. Notice that 
the ordered pair (0, 0) belongs to the origin. For all points 
on the x axis, the y component is zero. For all points on the 
y axis, the x component is zero. We are going to convert 
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the system of ordered pairs of real numbers into a number 
system in its own right by giving it the necessary structure. 

The point to which an ordered pair is assigned may be 
thought of as a picture of the pair. We can also associate an- 
other kind of picture with each pair in the following way. 
Draw an arrow from the origin to the point, with the arrow- 
head pointing away from the origin. We may think of the 
arrow as the second picture of the ordered pair. Each 


>- 


(U) 



(-2,3) 


(-3,-5) 


ordered pair, except (0, 0) has an arrow of its own. The 
arrow has a definite length, and a definite direction. We can 
even assign an arrow to the pair (0, 0) by giving it an ar- 
row whose length is zero. However, we cannot assign any 
direction to this zero arrow. 

Addition of Arrows 

The representation of ordered pairs as arrows is very 
helpful, because it guides us to an appropriate definition 
of addition for our system of ordered pairs. There are many 
practical situations where we encounter just such arrows, 
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and where there is a natural kind of addition that takes 
place. For example, in physics, a force can be represented 
as an arrow. The length of the arrow indicates the strength 
of the force, and the direction of the arrow indicates the 
direction of the force. If two forces act on a body at the 
same point, the effect is the same as though the body were 
acted on by a single force called the sum or resultant of the 
two forces. One way to find this sum or resultant is to draw 
the two forces as arrows at the origin, and then complete the 
parallelogram that has the two arrows as sides. The diag- 
onal of the parallelogram that can be drawn from the origin 
is the resultant of the two forces. 

•a 

O 

>• 



However, there is another, simpler way of getting the re- 
sultant, too. Each force, considered as an arrow, has an x 
component and a y component. To add the two forces, all 
we have to do is add their x components separately, and add 
their y components separately. For example, the forces 
being added in the diagram above can be represented as 
arrows, or as the ordered pairs, (4, 1), and (1, 2). The x 
components are 4 and 1, and the y components are 1 and 2. 
Adding them separately, we find that the resultant or sum 
of the forces belongs to the ordered pair (5, 3). This ex- 
ample suggests how we should define addition of ordered 
pairs, if we want them to be useful for the solution of 
practical problems. We therefore give the following defini- 
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tion of addition in the new system: To add two ordered 
pairs, add their components separately. In symbols, this 
definition says, (a, b) + (c, d) = (a + c, b + d). Because 
the sum of two ordered pairs is also an ordered pair, this 
addition is a binary operation. 

There are several facts that we can observe about this 
binary operation immediately. First, it is associative. That 
is, [(a, b) + (c, d)] + (e, /) = (a, b) + [(c, d) + (e, /)]. 
This follows from the fact that we carry out the addition by 
adding the components, which are real numbers, and addi- 
tion of real numbers is associative. For example, 

[(2, 3) + (3, 1)] + (4, 8) = (5, 4) + (4, 8) = (9, 12). 

But (2, 3) + [(3, 1) + (4, 8)] = (2, 3) + (7, 9) = (9, 12). 
too. Secondly, the addition of ordered pairs is commutative. 
That is (a, b) + (c, d) = (c, d) + (a, b). This follows from 
the fact that addition of real numbers is commutative. For 
example, (2, 3) + (3, 1) and (3, 1) + (2, 3) both yield the 
same sum, (5, 4). Third, the system has a zero element, 
because (0, 0) + (a, b) = (0 + a, 0 + b) = (a, b). Finally, 
for each element in the system, there is a negative in the 
system, too, with the usual property that a negative is sup- 
posed to have. That is, the sum of any ordered pair and its 
negative is equal to the zero element. The negative of any 
ordered pair (a, b) is the ordered pair (—a, —b). For ex- 
ample, the negative of (2, 3) is (—2, —3) because (2, 3) + 
(—2, —3) = (0, 0). With these four characteristics, the 
system of number pairs we have constructed meets all the 
requirements for being an abelian group, with the group 
operation denoted by the plus sign. 

With this observation, we complete the first half of the 
job of converting these elements into a number system. We 
shall go on with the second half of the job in Chapter VIII. 
Meanwhile we take a detour along a road pointed out by 
our system of arrows. As we follow this road we shall en- 
counter some more mathematical structures that play an 
important part in modern mathematics. 
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Stretching the Arrows 

In the diagram below we see an arrow representing the 
ordered pair (2, 1). If the arrow is stretched so that its 
length is doubled while its direction remains unchanged, 
we get a new arrow (the shaded arrow in the diagram). 
If the arrow is contracted to half its original length we also 
get an arrow pointing in the same direction. In the first 
case, the length of the arrow was multiplied by a factor of 
2. In the second case it was multiplied by a factor of 
Because of this fact, it is natural to think of the operation 
of stretching or shrinking an arrow as a kind of multiplica- 
tion of the arrow by a real number. To see how we may 



define the operation in terms of ordered pairs, notice that 
when the arrow (2, 1) was doubled, each of its components 
was doubled, giving the arrow (4, 2) as the result. So we 
define a special kind of multiplication for our system of 
ordered pairs as follows. To multiply an ordered pair of 
real numbers by a real number, multiply each of its com- 
ponents separately by that real number. In symbols, the 
definition says a • (6, c) = (a • b, a • c). It is important 
to notice that this kind of multiplication differs in one 
important respect from all the other multiplications defined 
so far in this book. In each of the earlier cases, the multi- 
plication was an operation on two elements drawn from 
the same system. In this case, the two elements being mul- 
tiplied are drawn from two different systems. One of the 
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multipliers is drawn from the field of real numbers. The 
other multiplier is drawn from the abelian group consisting 
of ordered pairs of real numbers. In order to sharpen the 
distinction between these two systems, we introduce some 
special names for them. We call the elements of the real 
number field scalars. The ordered pairs, or their pictorial 
representations as points in the plane or as arrows, will be 
called vectors. The kind of multiplication in which a vector 
is multiplied by a scalar to form a product which is also a 
vector is called scalar multiplication. We can verify from 
our definitions that scalar multiplication is distributive 
with respect to vector addition. That is, if s is a scalar, and 
(a, b) and (c, d) are vectors, then s ■ [(a, b) + (c, d)] = 
s • (a, b) + s • (c, d). To prove this rule, observe first that 
the expression on the left instructs us to do the vector addi- 
tion first, and the scalar multiplication afterwards. So 
s • [(a, 6) + (c, d)] = s • (a + c, b + d) = (s ■ [a + c], 
s • [b -j- d]) = (s • a + s • c, s • b + s • d) . The expression 
on the right instructs us to do two scalar multiplications first, 
and then add the results by vector addition. Following 
these instructions, we find that s ■ (a, b) + s • (c, d) = 
(s • a, s ■ b) + (s ■ c, s ■ d) = (s • a + s ■ c, s • b + s • d). 
The proof is completed by observing that the two results 
are the same. 

Scalar multiplication also satisfies another distributive 
law. It is distributive with respect to the addition of scalars. 
That is, if s and t are scalars, and ( a , b) is a vector, then 
(s + t) • (a, b) = s ■ (a, b) + t ■ (a, b). To prove this rule, 
we observe that carrying out the indicated operations on 
both sides of the equals sign leads to the same result: 
(s + t) • (a, b) = ( [s + t] • a, [s + t] • b) — (s ■ a + t • 
a, s ■ b + t • b). s • (a, b) + t ■ (a, b) = (s • a, s • b) + 
(t ■ a, t • b) = (s • a + t ■ a, s • b + t • b). 

Scalar multiplication also obeys a mixed associative law, 
in which two types of multiplication appear: 1) scalar 
multiplication, in which a scalar multiplies a vector, and 
2) multiplication of scalars, in which a scalar multiplies a 
scalar. This law says that, if s and t are scalars, and (a, b) 
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is a vector, then (s • t) ■ (a, 6) =«•[*• (a, 6) ]. To prove 
this law, observe that (s • t) • (a, &) = ([>•£] - a, [s • £] 
■ b) = (s ■ t ■ a, s ■ t • b). But s • [t • (a, &)]=«•(£• a, 
i • b) = (s • [t - a], s • [t ■ 6]) = (s • t • a, s ■ t ■ b). 

Scalar multiplication also has the property that the num- 
ber 1, which is the unity element for multiplication of a 
scalar times a scalar, is also the unity element for multipli- 
cation of a scalar times a vector. This is seen from the fact 
that, if (a, b) is a vector, 1 • (a, b) = (1 • a, 1 • b) = 
(a, b). 

We shall now summarize these properties of scalar mul- 
tiplication in a new abbreviated notation. So far we have 
always written a vector as an ordered pair, in which its two 
components are put on display. In the abbreviated notation, 
we represent a vector by a single symbol, with a little arrow 

over it to remind us that it stands for a vector. Thus, x, y 

and 2 represent vectors. Symbols like r, s, and t, written 
without arrows, will represent scalars. In this notation, the 
properties of scalar multiplication are expressed in this 
form: 

I. Distributive laws: r • (x — z/) = r • x -f- r • y 

— ► — ► — » 

(r + s) • x = r • x + s • x 

— > —4 

II. Mixed Associative law: r • (s • x) = (r ■ s) ■ x 

— ► — > 

III. Unity element: 1 • x = x 

Because of the characteristics we have observed in the 
system of ordered pairs (pictured as arrows), this system 
serves as an example of the kind of mathematical structure 
that is called a vector space. The name vector space is used 
to describe any system of elements that has these prop- 
erties: 1. It is an abelian group. 2. There is associated with 
this group another system of elements that is a field, and is 
called the scalar field. A scalar multiplication exists, in 
which a scalar from the field multiplies a vector from the 
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vector space, and the product is a vector in the vector space. 
3. The scalar multiplication has properties I, II, and III 
listed above. 

Some Other Vector Spaces 

Vector spaces won recognition as a special kind of struc- 
ture worthy of separate study when mathematicians re- 
alized that there are many systems that have this kind of 
structure. In the example that we have been observing, 
each element in the vector space was an ordered pair of real 
numbers. We can construct another vector space, by using 
as elements ordered triples instead, like (1, 3, —2). In 
this system, each element has three components. We could 
define vector addition and scalar multiplication for this 
system in the same way that we did for the system of 
ordered pairs: To add two vectors, add their separate com- 
ponents separately; to multiply by a scalar, multiply each 
component separately by that scalar. For example, (2, 3, 
-5) + (1, 4, 2) = (3, 7, —3). And 2 • (2, 3, -5) = (4, 6, 
— 10). With these definitions of vector addition and scalar 
multiplication, the system of triples meets all the require- 
ments for being a vector space. The system of ordered pairs 
is an example of a vector space of two dimensions, and is 
represented pictorially by points in a plane. The system of 
ordered triples is an example of a vector space of three 
dimensions, and can be represented pictorially by the points 
in three dimensional space. By using ordered quadruples as 
elements, we can construct a vector space of four dimen- 
sions. In general, we can construct a vector space of n 
dimensions by using as elements ordered sets of real num- 
bers with n numbers in each set. If we take n — 1, the 
vectors are single real numbers, and the vector space is one 
dimensional. In other words, the real number system may 
be thought of as a vector space of one dimension whose 
associated scalar field is also the real number system. 

Another familiar system that has the structure of a 
vector space is the system of all polynomials whose terms 

134 


are powers oi x multiplied by some real number. Typical 
elements in this system look like this: 

5x 4 + 3x 2 - 2x + 7 

\/2 x 2 3x T 9 

The two polynomials shown here are of the fourth and 
second degree respectively. There are also polynomials that 
do not contain x at all. They are just ordinary real numbers 
like 5, 7, -4, V3, and so on. Each of them except 0 is called 
a polynomial of zero degree. (No degree is assigned to 0.) 
In this system, each polynomial is a vector. The real number 
system is the associated field of scalars. Vector addition is 
carried out by adding polynomials according to the rules 
taught in elementary algebra. The zero element for this ad- 
dition is 0, a polynomial of no degree. Scalar multiplication 
is cariied out by multiplying a polynomial by a real number 
according to the rules taught in elementary algebra. With 
these two operations, the system of polynomials meets all 
the requirements for being a vector space. In this vector 
space, each distinct power of x is a separate component. Any 
one polynomial contains only a finite number of compo- 
nents. But, since any positive power of x, like x 9 , or x 187 , 
may be used in a polynomial, there is an infinite number of 
components to choose from in constructing a polynomial. 
Therefore this system is an example of a vector space of 
infinite dimension. 

The i, j, k Notation 

In the study of physics it is found that such things as 
forces, velocities, and rotations can be represented as vec- 
tors in a three dimensional vector space. To carry out com- 
putations in this vector space the physicist uses a special 
notation in which all vectors are expressed in terms of 
three unit vectors called i, j, and k. They are defined as 
follows: i = (1, 0, 0) ; j = (0, 1, 0 ) ; k = (0,0, 1). The vector 
(2, 3, 5) can be expressed in terms of these units in this 
way: (2, 3, 5) = (2, 0, 0) + (0, 3, 0) + (0, 0, 5) = 2 • (1, 
0. 0) + 3 • (0, 1, 0) + 5 • (0, 0, 1) = 2 i + 3; + 5k. 
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In general, the vector (a, b, c) = ai + bj + ck. When this 
notation is used, all steps in vector addition and scalar 
multiplication become simple exercises in elementary high 
school algebra. For example: 

Vector addition: 

(2 i + 3 j + 5k) + (3z - 4 j +2 k) = 5i-j + 7k 
Scalar multiplication: 

2(2 i + 3; + 5k) =4 i + 6 j + 10A; 

Mapping the Plane Into Itself 

On page 131, we saw that multiplying the vector (2, 1) 
by the scalar 2 has the effect of doubling the length of the 
associated arrow without changing its direction. If we multi- 
ply each of the vectors in the plane by 2, every one of the 



Solid arrow represents (x,y) 
Shaded arrow represents (x',/) 


associated arrows has its length doubled. The effect of 
the multiplication is to stretch the entire plane uniformly 
in all directions, so that each point of the plane is pulled to 
a new position, twice as far from the origin as it was before. 
We can represent a typical vector by the symbol ( x , y), 
where x is its x component, and y is its y component. As a 
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result of the stretching, the vector ( x , y ) is changed into 
the vector (2x, 2 y). If we call the components of the new 
vector x? and y', the relationship between the old compo- 
nents and the new components is expressed in the equa- 
tions : x' = 2x, and y' = 2 y. 

The stretching of the plane is an example of a mapping 
of the vector space into itself, whereby each element in 
the vector space is mapped into a particular image. Such a 
mapping is called a transformation. The equations above 
serve to define the transformation precisely, by giving di- 
rections for calculating, from the components of any vector, 
the components of the image into which it is mapped. If 
the stretching of the plane is one in which each arrow has its 
length tripled, the equations are x' = 3x and y' = 3 y. If 
we contract the plane, to shrink each arrow to half its size, 
the equations are xf = ix, and y' = \y. If we reverse the 
direction of each arrow in the vector space, the equations 
are x' = —x, and y' = —y. Such a reversal of direction is 
called a reflection. In general, equations of the form xf = kx 
and y' = ky, where k is some fixed real number, define a 
stretching or contraction of all vectors, with or without a 
reversal of direction. 
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Old position 



There are other transformations, too, that can be de- 
scribed pictorially in terms of motions or deformations of 
the plane. Each is defined precisely by an associated pair 
of equations that shows how the new components (of the 
image vectors under the mapping) are computed from 
the old components. The equations xf = 3x, y' = y define a 
transformation which stretches the plane horizontally, 
while leaving each point as close to the x axis as it was 
before. The equations x' = x + 2y, y' = y define a trans- 
formation which also leaves points at the same height above 
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or below the x axis, while moving them horizontally through 
varying distances that depend on their height. As a result 
this mapping moves the points of the y axis into new posi- 
tions on a sloping line, as shown in the first diagram. The 
effect is a shearing of the plane, which deforms rectangles 
into parallelograms, as shown in the second diagram. The 
equations x' = x cos 6 — y sin 6, y' = x sin e + y cos 9 define 
a rotation of the plane around the origin as center. All these 
mappings are special examples of a particular family of 
transformations whose equations have this simple form: 

x' = ciix + a 2 y 

y' = + b 2 y 

Mappings that can be written in this form, where ai, a. 2 , b\, 
bo may be any fixed real numbers, are called linear map- 
pings. 

Another kind of mapping is obtained when the entire 
plane is moved in the direction of some particular vector 
through a distance equal to the length of that vector. Such 
a mapping is called a translation. For example, the equa- 
tions x' ~ x + 2, ]/ ~ y 3 define a translation that moves 
every point in the plane in the direction and through the 
distance specified by the vector (2, 3). In general, equations 
of the form x' = x + h, y' = y -f k, where h and k are fixed 
real numbers, represent translations of the plane. A more 
complicated type of transformation is obtained when a 
linear mapping is followed by a translation. Then the equa- 
tions look like this: 

x' = aix + a 2 y + h 
y’ = b x x + b 2 y + k 

Reversible Mappings 

The mapping defined by the equations x' = x + 2, y' = 
y + 3 moves every point in the plane two units to the right 
and three units up. This mapping is easily reversed. If, after 
the mapping, each point is moved two units to the left and 
three units down, the points all return to their original 
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positions. The reversibility of the mapping can be ex- 
pressed in terms of the equations, too. The equations show 
how to calculate the new components x' and y' from the old 
components x and y. If we solve the equations for x and y, 
we get equations in this form: x = x' — 2, y = y' — 3. 
These equations give directions for calculating the old 
components from the new, so they are the equations of the 
reverse mapping. In this reverse mapping, considered as a 
mapping in its own right, x' and y' are the components of 
a vector in its initial position, and x and y are the com- 
ponents in the final position. To conform to the original 
notation, in which the primed symbols are used to repre- 
sent the new position, we represent the reverse mapping 
in this form: x' — x — 2, y' = y — 3. In general, for every 
translation, represented by the equations x' = x + h, y' = 
y + k there is a reverse translation whose equations are 
x' = x — h, y' = y — k. 

The Translations Form a Group 

In the equations x' = x + h, y' — y + k, every time we 
pick definite values for h and k, we have equations repre- 
senting some definite translation. For example, the pair of 
equations x' = x + 2, y' = y + 3 represents one translation. 
Let us call this translation P. The pair of equations xf = 
x + 5, y' = y + 7 represents another translation. Let us call it 
Q. If we take all possible values for the numbers h and k, 
we get all possible translations of the plane into itself. 
These translations form a system of elements in which an 
operation of multiplication can be defined. 

Just as we did in the case of the symmetries of the tri- 
angle, back in Chapter III, we define the product of two 
translations as the result of performing one translation after 
another, with the second one taking over where the first 
one leaves off. The combined effect of the two translations 
is equivalent to a single transformation, which, in fact, is 
also a translation. 

For example, suppose we designate by Q * P the trans- 
formation we get when P is performed first, and Q is per- 
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formed next on the results of P. P transforms components 
x and y by adding 2 and 3 to them respectively, so the new 
components are x + 2, and y + 3. Q transforms these com- 
ponents by adding 5 and 7 to them, respectively, so the final 
components are x + 7 and y + 10. So the equations for the 
product Q * P are: xf = x + 7, y' = y + 10. These are of 
the form xf = x + h, y' = y + k, with the special values 
7 and 10 for h and k. So Q * P also belongs to the system of 
translations. Because the product of two translations is it- 
itself a translation, the operation * is a binary operation 
in the system of translations. The operation is associative, 
because, if P, Q, and R are three translations, (P * Q) * P 
and R * (Q * P) both mean P followed by Q followed by R. 
The translation defined by the equations xf — x + 0, y' — 
V + 0 doesn’t move the plane at all, and is the identity 
element for the operation *. If we designate it by I, then 
P*I = I*P = P. We have already observed that every 
translation is reversible. Let us designate by P -1 the transla- 
tion that is the reverse of P. If it is applied after P, it brings 
every point in the plane back to its original position. So 
the product P -1 * P is equal to the identity element I. 
Similarly, P * P -1 = I. So P -1 is the inverse of P with re- 
spect to the operation *. Because of these properties, the 
system of translations constitutes a group with respect to 
the operation *. In fact, it is an abelian group, because the 
operation turns out to be commutative. 

The Linear Group 

In the equations 

x' = a-ix + a 2 y 

y' = bix + b 2 y, 

every time we pick definite values for a u a 2 , b x and b 2 , we 
get equations representing a particular linear mapping of the 
plane into itself. If we take all possible real values of eu, a 2 , 
b u and b 2 , we get all possible linear mappings. They form a 
system of elements in which the operation *, defined in the 
same way as for translations, is a binary operation, because 
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the product of two linear mappings is itself a linear mapping. 
For example, if P stands for the mapping 

x' = 2x + 3y 

y' = 5x — y 

and Q stands for the mapping 

x" = 3x' + 4 y\ 
y" = 2x' + 7 y' 

we get the mapping Q * P by applying the mapping Q to the 
results of P as follows: 

x" — 3(2x + 3 y) + 4(5x — y) = 6x + 9y + 20x — 4 y 
y" = 2(2x + 3 y) + 7(5x — y) = 4x + 6y + 35x — 7y. 
So the transformation Q * P has the equations 
x" = 26x + 5 y 
y" = 39x — y 

These are the equations of a linear mapping in which a h a 2 , 
K and b 2 have the special values 26, 5, 39 and -1 re- 
spectively. 

However, the system of all linear mappings does not form 
a group. It fails to qualify as a group because not all the 
linear mappings are reversible. The reason for this diffi- 
culty can be seen by comparing one of the troublesome 
linear mappings with a translation. A translation maps each 
point into an image in such a way that no two points have 
the same image. We can reverse the translation by carrying 
each point back to the single point of which it is the image. 
However, the linear mapping defined by the equations 

x' = Ox + Oy = 0 

y' = Ox + Oy = 0, 

behaves differently. It carries all points into the origin. 
As a result, the origin is the image of not one point, but 
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many points. A many-to-one mapping, as we saw on page 
15 , is not reversible. 

However, there are some linear mappings that are re- 
versible. If we put aside those linear mappings that are not 
reversible, and keep only those that are reversible, then we 
get a subset of the system of linear mappings that does 
constitute a group. In this subset, every linear mapping has 
an inverse, and all the other requirements for qualifying as 
a group are satisfied. We can find out which linear mappings 
are reversible by actually trying to reverse one, and seeing 
what conditions must be fulfilled to attain success. Suppose 
we take the equations of a linear mapping in general form : 

x' = aix + a 2 y 

y' = b& + b 2 y. 

To reverse the mapping means to solve for x and y in terms 
of x' and if. Let us first write the equations with the x' 
and y' on the right hand side, and then solve for x and y 
by the usual method for solving simultaneous equations by 
eliminating one of the unknowns. To eliminate y, we multi- 
ply the first equation by 62, multiply the second equation 
by — a 2 , and then add the resulting equations: 

aj) 2 x + a 2 b 2 y — b 2 x’ 

— a 2 b 2 x — a/b 2 y — — a 2 y' 
a x b 2 x — a 2 bix = b 2 x' — a 2 y' 

The distributive law permits us to rewrite the left hand 
side of the equation in factored form: (aih 2 — a 2 bi)x = 
5 2 x' —an/ . The next step in solving for x would be to divide 
both sides of the equation by (0162 — a 2 bi) to get as a result 

_ b 2 x' — a 2 y’ 
a 2 b 2 — a 2 bi 

However, this step is not always possible. We know from 
our discussion on page 74 that division makes sense only 
when the divisor is not zero. So we can succeed in reversing 
the linear mapping if and only if 0162 — 0261 is not equal 
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to zero. The number 0162 — <2261 is called the determinant 
of the linear mapping. So we can say that a linear mapping 
is reversible if and only if its determinant is different from 
zero. For example, suppose linear mappings A and B are 
defined as follows: 

^ |x' = 2x + 3 y ix' = 2x - 3 y 

\y' = 4x + 6y }y' = 3 x + y 

The determinant of mapping A is 2 ■ 6 — 3 • 4 = 0, so A is 
not a reversible mapping. The determinant of mapping B 
is 2 • 1 —(—3) • 3 = 11, which is different from zero, so B 
is a reversible mapping. In fact, if we solve for x and y in 
terms of x' and y', we get 


1 , , 3 , 

Ti x 


y = 




These equations have the right form to qualify as a linear 
mapping, which we may designate as B -1 , or the inverse of 
B. Moreover, B _1 is reversible, because solving for xf and 
y' leads back again to the original equations of B. 

The system of reversible linear mappings is known as 
the linear group. It includes all rotations, stretches, and 
shears of the plane. 

If a reversible linear mapping is followed by a transla- 
tion, the product is called an affine transformation. The sys- 
tem of all such products also turns out to be a group, and 
is known as the affine group. If a uniform stretching of the 
plane is followed by a translation, the product is called a 
similitude. The system of all such products constitutes an- 
other group, known as the group of similitudes. If we put 
together in one set of transformations all rotations, trans- 
lations, and reflections, this set, too, is a group, and is known 
as the Euclidean group. 
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What Is Geometry? 

The fact that transformations of a plane into itself can 
be associated with each other in families of transforma- 
tions, some of which have a group structure, has led to a 
new insight into the meaning of geometry. In the geometry 
we study in high school, considerable time is devoted to 
the study of congruent figures. We try to find out what 
figures are congruent to each other. We also investigate 
properties of a figure that it has in common with any other 
figure to which it is congruent. Such properties include 
lengths of corresponding lines, sizes of corresponding angles, 
area, etc. Two figures were defined as being congruent if 
they could be made to coincide. This definition implied the 
use of a motion to carry one figure onto the other. To assure 
ourselves that the figure would not be deformed while it 
was being moved, we calmed our fears with the “axiom” 
that a geometric figure can be moved from place to place 
without changing its form or size. The effect of this axiom 
was to banish from the realm of legitimate motions all 
stretches and shears. At the same time, it singled out as 
the only legitimate motions those that we have called rota- 
tions, translations and reflections. But these are the motions 
that make up the Euclidean group of transformations. This 
fact makes it possible to define more precisely w r hat is meant 
by congruence. Two figures are congruent if one can be 
mapped onto the other by a transformation that belongs 
to the Euclidean group. This definition also gives a new 
meaning to such concepts as length, area, and so on. They 
turn out to be among the characteristics of a figure that 
remain unchanged when it is transformed by a mapping 
that belongs to the Euclidean group. 

Another subject treated in high school geometry is that 
of similar figures. The concept of similarity can also be de- 
fined in terms of a group of transformations. Two figures 
are similar if one can be mapped onto the other by means 
of a transformation that belongs to the group of similitudes. 
A similitude does not leave the length of a line unchanged, 
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but it does leave unchanged such things as angles, and 
ratios of lengths. The fact that the two traditional concerns 
of plane geometry can be described best in terms of groups 
of transformations has led to the modern notion of what a 
geometry is. A geometry is now defined as the study of 
figures which can be mapped into each other by a group cf 
transformations, and of the properties of figures that remain 
unchanged when the transformations in the group are ap- 
plied. In the sense of this definition, what we studied in 
high school was not geometry, but some geometries. When 
we studied congruence, we were studying Euclidean geom- 
etry. associated with the Euclidean group. When we studied 
similarity, we were studying a different geometry, associ- 
ated with the group of similitudes. Moreover, there are 
other geometries which we did not study in high school at 
all. For example, there is an affine geometry associated w r ith 
the affine group. Because there are many groups that may 
operate on the same vector space, there is a multiplicity of 
geometries belonging to one and the same space.* 

DO IT YOURSELF 

1. Use the definition of vector addition given on page 130 
to find the following vector sums : 

(3, 2) + (-1,2) (8,-5) + (-5,8) 

(4,7) + (-4,-7) (2,0) + (0,3) 

2. Use the definition of vector addition to prove that 
(a, b ) + (c, d) = (c, d) + (a, b). (Commutative Law of 
Addition) 

3. Locate the points (2, 3), (-5, 4), and (-6, -2) on a 
graph with coordinates measured from an x axis and a y 
axis that are perpendicular to each other. Carry out the 
following scalar multiplications, and locate the products 
on the graph: 3 • (2, 3); 2 • (—5, 4); — £ ■ (—6, -2). 
Verify from the graph that the first two scalar multipli- 
cations change the length of the arrow belonging to the 
vector, without changing its direction. Verify that the 

*For more about the many geometries that have evolved see A New 
Look at Geometry, by Irving Adler (New York, The John Day Company 
1966 ). 


third one changes the length and reverses the direction 

4. Represent these three-dimensional vectors in the i, j, k 
notation : 


(4,6,1) (-2,1,0) (0,5,7) 

5. Two linear mappings P and Q (mapping a plane into 
itself) are defined as follows. Find the equations that 
define the product Q * P. 


P: 


x' = x + y 
y' = x - y 


Q: 


! x' = 2x + y 
y' = x - ?>y 


Find the equations that define the product P * Q. Is 
Q*P = P*Q? Is the operation * for such linear map- 
pings commutative? 

6. Find the inverse of the mapping P defined in question 5. 
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CHAPTER VII 


The Rank and File: 
Matrices 


IN THIS chapter we re-examine from another point 
of view the linear mappings of a plane into itself. In the 
course of this re-examination we shall get acquainted with 
another type of mathematical structure that is important. 
At the same time we shall acquire some equipment that will 
be useful to us in the next chapter where we finish the job 
of constructing a number system that supplies a number 
for every point in the plane. 

Some typical linear mappings of the plane are listed be- 
low, with a capital letter assigned to each one as its name : 


P: 


x' = lx + 1 y 
y' = lx - 1 y 


Q ix' = Ox + Oy 
\y' = Ox + Oy 


lx' = 2x + ly 

Q: , 

\y = 1 * - 3 y 
lx' = lx + 0 y 
/' = Ox + 1 y 


The mapping called 0 could have been written more briefly 
in the form x' = 0, y' = 0. Similarly, the mapping called / 
could have been written as x' = x, y' = y. However, show- 
ing the zero coefficients explicitly, as we have in the longer 
way of writing them, has the advantage of stressing the fact 
that all linear mappings have the same form. In each map- 
ping, the new x and the new y are obtained by adding some 
multiple of the old x to some multiple of the old y. Two 
different linear mappings differ from each other only by 
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virtue of the fact that they use different multiples. What 
distinguishes one linear mapping from another, then, is the 
set of four coefficients that appear in the equations of the 
mapping. This fact suggests that we can represent a mapping 
in an abbreviated notation in which we delete the letters x, 
y, x' and y', and merely list these four coefficients, arranging 
them in a square array, just as they are arranged in the 
equations written above. Such a square array of numbers 
is called a matrix. In this case, because the matrix has two 
rows and two columns, we call it a 2 by 2 matrix. Every 
linear mapping of the plane is associated with such a 2 by 2 
matrix, and, vice versa, every 2 by 2 matrix of real numbers 
belongs to some linear mapping of the plane. To show this 
correspondence, we shall use as the name of a matrix the 
name of the mapping that it belongs to. Here are the 
matrices of the mappings P, Q, O, and I: 



1 1 


2 1 


0 

0 


1 0 

p = 

1 - 1 

Q = 

1 - 3 

0 = 

0 

0 

I = 

0 1 


Matrices Form a Vector Space 

Now that we have a collection of matrices, we can disre- 
gard their origin as sets of coefficients belonging to linear 
mappings, and may think of them as constituting an inde- 
pendent system of elements. We proceed to give this system 
a structure in a simple and natural way. We convert it into 
a vector space by methods like those we used in the last 
chapter. Just as the vector (2, 3) is made up of two com- 
ponents, each distinguished by the place it occupies in the 
ordered pair, each matrix is made up of four components, 
each distinguished by the place it occupies in the square 
array. So we can define addition of matrices the way we 
defined addition of ordered pairs. To add two matrices, add 
the corresponding components separately. We can also de- 
fine a scalar multiplication for matrices, using the field of 
real numbers as the field of scalars, in the same way that we 
defined scalar multiplication for ordered pairs. To multiply 
a matrix by a scalar, multiply each of its components by 
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that scalar. Examples of addition and scalar multiplication 
for matrices are shown below: 


P + Q 


1 1 
l-l 


+ 


2 • Q = 2 • 


2 1 


1+2 1 + 1 

1-3 


1 + 1 ( — !) + (— 3) 


2 1 


2(2) 2(1) 


4 2 

1 - 3 


2(1) 2( — 3) 


2-6 


3 2 
2-4 


With addition and scalar multiplication defined in this 
way, the system of 2 by 2 matrices becomes a vector space. 
To verify this fact, we have to show that it has all the 
characteristics of a vector space, listed on page 133. First, 
we observe that it is an abelian group with respect to addi- 
tion. This is proved below by showing that addition is asso- 
ciative and commutative, that it has a zero element, and 
that every matrix has a negative: 

Addition is associative: 


Let P = 


Ol 

6i 


(P + Q)+R 


0*2 

&2 


Q = 


fli + Ci 

bi + d\ 


Cl c 2 

(l l (1-: 

a 2 + c 2 
b 2 + d 2 




e 2 

M 


e 2 

f 2 


P + (Q + R) 


( a i + Ci) + e x (a 2 + c 2 ) + e 2 

(bi 4~ d{) -f- fi (b 2 -f- d 2 ) -f" / 2 

Ci 4- Ci Co 4- c 2 

di + fi d 2 4- f 2 


ai CI2 

bi b 2 


«i 4 - (ci 4- Ci) 02 4- (c 2 + e 2 ) 

bi 4 - (d x 4- / x ) b 2 4- (dj -f- / 2 ) 

But (ai 4- Ci) 4- e x = a x 4- (ci + e x ), etc., because addition 
of real numbers is associative. Therefore (P 4- 0) + R == 
•P 4 - (Q 4 - R). 

Addition is commutative: 
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P + Q = 


Q + P = 


fll 

&2 

+ 

Ci 

Ci 



61 

bi 

di 

d/2 


Cl 

C2 

+ 

ai 

02 


di 

di 

61 

62 



Oi + Ci 

CLi + c 2 

b\ + di 

b-i + di 

Ci 4 ~ a 1 

Ci CLi 

d\ + b\ 

di T" ^2 


But a x + Ci = Ci + CL\, etc., because addition of real numbers 
is commutative. Therefore P + Q = Q + P. 

There is a zero element: 


0 + P = 


0 

0 

+ 

ai 

a 2 



0 

0 

61 

bi 



0 + a x 
0 + bi 


0 + a 2 

0 + bi 


ai 

6i 


= P 
0 


Therefore the matrix 0 = 

0 0 

matrix addition, and is called the zero matrix. 
Every matrix has a negative: 


In fact, the negative of 


is the zero e'ement for 


a x 

O2 

is 

-ai 

— a-\ 

61 

bi 


- 5 , 

-62 


because 


d\ 0/2 

+ 

— 0 \ — O2 

= 

«1 + ( — ®l) 

do “f" ( — Oo) 

b\ bi 

— b x —bi 


61 + (-M 

t>2 + ( — &Si) | 


= the zero matrix 


We observe next that scalar multiplication for matrices 
obeys the two distributive laws, the mixed associative law, 
and the law that the number 1 serves as a unity element for 
scalar multiplication : 

Distributive laws: 


r(P + Q) = r 


ffli 

bi 


&2 

bi 


+ 


Cl 

di 


C2 

di 


= r 


a 1 + Ci a-2 + 

bi + d\ b -2 + di 
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r(ai + ci) 

r{a 2 + c 2 ) 


rai + rci 

ra 2 + rc 2 

r(bi + dj) 

r(b 2 + d 2 ) 


rbi + rd x 

rb 2 + rd 2 


rP + rQ = r 


d\ (3-2 

bi b 2 

rai ra 2 

rbi rb 2 


+ r 


Ci 

d x 


+ 


rci 

rdi 


c 2 

d 2 

rc 2 

rd 2 


rai + rci ra 2 + rc 2 
r6i + rdi rb 2 + rd 2 


Therefore r(P + Q) = rP + rQ. 
(r + s)P = (r + s) 


CLl CL2 


(r + s)a x 

(r + s)a 2 

bi fc 2 


(r + s)bi 

(r + s)5 2 


rai + sa x ra 2 + sa 2 
r6i + s6i r& 2 + sb 2 


rP sP = r 


+ s 


+ 


ai 

61 

sai 

s&i 


CI2 

Sd2 

sb 2 


ai a 2 
6i fe 2 
rai ro2 
rbi rb 2 

rax + sai ra 2 + sa 2 

r6i + sbi rb 2 + sb 2 

Therefore (r + s)P = rP + sP. 
Mixed associative law: 


r(sP) = r ( s 


/ 

ai 

CL 2 

\ 

sai 

sa 2 

( s 



) = r 



V 

bi 

b 2 

/ 

sbi 

sb 2 


(rs)P = (rs) 



r(sai) 

r(sa 2 ) 


rsa 1 


r(s5i) 

r(sb 2 ) 


rsbi 

ai 

CL 2 


(rs)ai 

(rs)a 2 

bi 

b 2 


( rs)b 


(rs)b 2 


rsa 2 | 

rsb 2 
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rsai rsa 2 
rsbi rsb 2 

Therefore r(sP) = (rs)P. 

Unity element for scalar multiplication: 


a x 

(h. 


lax 

la 2 


a x 

a 2 

b x 

62 
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16 2 


b x 

b 2 


Multiplication of Matrices 

In the last chapter we defined an operation of multipli- 
cation for linear mappings. Since each mapping has an 
associated matrix, we can transfer this operation to the 
matrices, and in this way define a multiplication operation 
for matrices. First, let us write down two matrices P and Q, 
and the mappings that are associated with them: 


a x 

&2 

associated 

x" = a x x' + a 2 y' 

b x 

62 

mapping 

y" = b x x' + b 2 y' 

Cl 

d x 

c 2 

d 2 


x' = c x x + c 2 y 

y' = d x x + d 2 y 


To get the product mapping P * Q, we first perform the 
mapping Q, and then perform the mapping P on the new 
components we get as a result of having used Q. 

x" = a x (c x x+c 2 y)-\-a 2 (d x x+d 2 y) — a x c x x + a x c. 2 y + a 2 d x x + a 2 d 2 y 

y" = b x (c x x+c 2 y)+b 2 (d x x+d 2 y) = b x c x x +6^?/^ +b 2 d 1 x+b 2 d 2 y 

By rearranging the terms to bring the x terms together, and 
the y terms together, and then factoring out the x and y, 
we find that the product mapping P * Q takes this form: 

x" = (aiCi + a 2 di)z + {a x c 2 + o 2 d 2 )y 
y" = (b x ci + MOx + (6 iC 2 + b 2 d 2 )y 
The matrix that belongs to this mapping is 
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<*iCi a^di d\C% a 2 d< 

I&1C1 -T bidi + b 2 d 2 . 

We call it the product matrix, and designate it by PQ. (We 
omit the * just as in elementary algebra it is customary to 
omit the multiplication sign.) 

Now, by comparing the matrix PQ with the matrices 
P and Q, we can observe a simple rule by which the multi- 
plication can be carried out without reference to the equa- 
tions of the mappings. Notice that in the product matrix 
PQ, the component in the first row and first column is 
mci -f a 2 di. This expression can be obtained from the pairs 
(m, a 2 ) and (ci, di) by first multiplying corresponding 
terms in the pairs, and then adding the products. The pair 
(ai, a 2 ) is the first row of matrix P. The pair (ci, di ) is the 
first column of the matrix Q. So we get the component in 
the first row and first column of the product PQ by multi- 
plying corresponding components of the first row of P and 
the first column of Q, and adding the results. In order to be 
able to refer to it in fewer words, let us designate this kind 
of operation as multiplication of the first row of P by the 
first column of Q. We can now use this language to describe 
the procedure for getting all the components of the product 
matrix. To get the component in the first row and second 
column, multiply the first row of P by the second column 
of Q. To get the component in the second row and first 
column, multiply the second row of P by the first column 
of Q. To get the component in the second row and second 
column, multiply the second row of P by the second column 
of Q. In general, to get the component of the product 
matrix PQ that is in any particular row and column, multi- 
ply the corresponding row of P by the corresponding col- 
umn of Q. 

If we apply this rule for multiplying matrices to get the 
product QP, w 7 e find that 

c l a l + C 2 &i C\d 2 + c 2 5 2 

didi d‘>b\ dido + d 2 b 2 . 
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QP = 


Cl Co 


d\ CL2 

rfl 


bi 6 2 


Notice that this product is not the same as PQ, so matrix 
multiplication is not commutative. 

The Matrices Form a Ring 

Although matrix multiplication is not commutative, it is 
associative. We do not have to give a special proof of this 
fact, because we have already observed that the operation 
*, performed on linear mappings, is associative, and matrix 
multiplication is merely another way of expressing the same 
operation. It is also possible to show that matrix multipli- 
cation is distributive with respect to matrix addition. Con- 
sequently, the system of 2 by 2 matrices meets all the re- 
quirements for being a ring, just as the natural numbers, 
the integers, the rational numbers, and the real numbers 
did. However, unlike these other systems, it is not a com- 
mutative ring, since matrix multiplication is not commu- 
tative. 

There is a unity element for matrix multiplication. In 
fact, the unity element is the matrix which we have called I. 
Since multiplication is not, in general, commutative, we 
have to verify separately that I behaves as a unity element 
when it is used as a multiplier either from the left or from 
the right. 


1 

0 


Oi 

a-. 


Icq 

+ 05! 

la 2 + 0b 2 


0 

1 


b 

i 

h 


Oai 

+ l&i 

0a 2 -f- 1& 2 





ai 


do 









&i 


b. 

= I 

> 




<Zl 

a 2 


1 

0 


dl * 

1 + (h 

• 0 fli • 0 -|- dj • 1 

&i 

b 2 


0 

1 


6i • 

1 + 5 2 

• 0 &i • 0 + & 2 • 1 


CL\ &2 
b i b 2 

Therefore IP = PI — P. 

It is natural to ask whether the ring of 2 by 2 matrices is 
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also a field. We find that it is not a field, because it' contains 
zero divisors, and a field may not have zero divisors, as we 
saw on page 87. Recall that zero divisors are non-zero 


and 


elements that have a zero product. The matrices 
0 3 

Q Q are different from the zero matrix, but their product 
is equal to the zero matrix: 


0 2 


0 3 


0 • 0 + 2 • 0 0-3-f-2-0 


0 0 

0 0 


0 0 


0-04-0-0 0-34-0-0 


0 0 


In fact, each of these matrices has the peculiar property 
that when it is multiplied by itself, the product is zero. If 

0 2 

we use the letter T to represent the matrix 


0 0 


then 


T 2 = T ■ T = 


0 2 
0 0 
0 0 
0 0 


0 2 


0-0 + 2- 0 0-24-2-0 

0 0 


0-0 + 0- 0 0-24-0-0 


= 0 


An element like T that has the property that there is a power 
of it that is equal to zero is called a nilpotent element. In the 
rings we met before, the zero element itself was the only 
nilpotent element. However, there are many rings which, 
like the ring of 2 by 2 matrices, have nilpotent elements that 
are different from zero. 

The Matrices Form an Algebra 

The ring of 2 by 2 matrices has a double structure. With 
matrix addition and scalar multiplication, it has the struc- 
ture of a vector space. With matrix addition and matrix 
multiplication, it has the structure of a ring. A system that 
has such a double structure is known as an algebra. 
Roughly, an algebra may be described as a ring that also 
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has a scalar multiplication defined for it, or a vector space 
that also has a binary operation of multiplication defined 
for it. The algebra we have just examined is only one of 
many possible algebras. In fact, by using definitions of addi- 
tion, scalar multiplication, and matrix multiplication anal- 
ogous to those given for 2 by 2 matrices, we can construct 
an algebra of 3 by 3 matrices, an algebra of 4 by 4 matrices, 
etc. 

Rectangular Matrices 

So far we have considered as matrices only square arrays 
of numbers. However, it is possible to extend the concept of 
a matrix to include rectangular arrays as well. If a rectangu- 
lar array of numbers has two rows and five columns, for 
example, it is known as a 2 by 5 matrix. A vector space can 
be constructed out of all 2 by 5 matrices by defining matrix 
addition and scalar multiplication in the same way that we 
did for square matrices. In general, we can give a vector 
space structure in this way to the system of all rectangular 
matrices that have any fixed number of rows and any fixed 
number of columns. From this point of view, the vectors we 
first described in the last chapter turn out to be special 
cases of matrices. An ordered pair like (2, 3) is nothing but 
a 1 by 2 matrix. An ordered triple is a 1 by 3 matrix, and so 
on. It is also possible to have a 2 by 1 matrix that has two 
rows and one column, or a 3 by 1 matrix that has three rows 
and one column. 

The definition of matrix multiplication can also be ex- 
tended to rectangular matrices, with a special restriction 
that arises from the way in which matrix multiplication is 
carried out. To multiply two matrices, we have to multiply 
each of the rows of the first matrix by each of the columns 
in the second matrix. This is possible only if a row of the 
first matrix has as many components as a column does in 
the second matrix. This means that multiplication of 
matrices is possible only when the number of columns in 
the first matrix is equal to the number of rows in the second 
matrix. For example, we can multiply a 2 by 3 matrix times 
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a 3 by 4 matrix, but we cannot multiply a 3 by 4 matrix 
times a 2 by 3 matrix. 

Rectangular matrices give us a very powerful condensed 
language in which systems of several equations may some- 
times be expressed as a single simple matrix equation. For 
example, consider these equations for a linear transforma- 
tion that maps the ordered triples (xi, x 2 , xz ) into the or- 
dered triples (2/1, y 2 , y z ) : 

Vi ~ (>iXi + a 2X2 + a 3 x 3 

2/2 = b\Xi -)- 1)2X2 b 3 x 3 
y z — C\X\ -f- C2X2 ~t- c 3 x 3 

These three equations are equivalent to the single matrix 
equation: 


2/1 


a, a 2 a 3 


Xi 

2/2 

= 

b\ 62 b 3 


x 2 

2/3 


Ci c 2 c 3 


Xz 


as you can verify by carrying out the matrix multiplication. 
If we introduce names for the matrices, as follows, 



2/i 


a x 

a 2 

a 3 


Xi 

Y = 

2/2 

P = 


62 

b 3 

X = 

x 2 


2/s 


Cl 

c 2 

c 3 


x Si 


then the equation takes the particularly simple form, Y = 
PX. Working with this single equation according to the 
rules of matrix algebra then takes the place of working with 
the three original equations. 

Matrix algebra, one of the youngest branches of mathe- 
matics, is now one of the most widely used. Besides being an 
indispensable tool in higher mathematics, it is also em- 
ployed in such diverse fields as psychology, chemistry, 
physics, economics, and electrical engineering. 
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DO IT YOURSELF 

1. Find the sums P + Q and Q + P, and the products PQ 
and QP of the following 2 by 2 matrices: 


P = 


Q = 


Compare your answers to verify that P + Q = Q -p P, 
but PQ is not equal to QP. 

a 

a) Find the scalar product 2 


2 0 


a b 

0 2 


c d 


b) Find the matrix product 


Compare the results of both multiplications. Notice that 

.20 

the matrix > used with matrix multiplication, be- 

0 2 

haves like the scalar 2, used with scalar multiplication. 


3. Let P = 


Q = 


-1 

4 


R = 


2 -1 

6 5 


a) Find P(Q + 72). That is, add Q and 72, and then 
multiply by P from the left. 

b) Find PQ + PR. That is, find the products PQ and 
PR, and then add them. 

c) Compare the results of a) and b) to verify that the 
distributive law is obeyed. 


4. Let T = 


0 1 0 
0 0 1 
0 0 0 


a) Find T 2 = TT. 

b) Find T 3 = T 2 T. What kind of element is T ? (See 
page 156) 
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CHAPTER VIII 


Arrows That Are Numbers 


NOW WE return to unfinished business. In Chapter 
VI we had set out to accomplish a double purpose. We 
wanted to construct an extension of the real number system 
that would supply a number for every point in a plane, 
and would at the same time contain a number that satisfies 
the equation ar = — 1. As the first step toward accomplish- 
ing this purpose we constructed a system in which each 
clement is an ordered pair of real numbers, like (1, 4), or 
(— 1 \/2, i?r). We defined an operation of addition for these 
elements, and an operation we called scalar multiplication. 
With these two operations the system became an example 
of the type of structure we called a vector space. 

In this vector space, we have an element for every point 
in the plane. How r ever, this does not mean that we have 
already accomplished our purpose. Our goal was to find a 
number system that supplies an element for every point in 
the plane. The vector space we have constructed does not 
yet qualify as a number system. A structure is entitled to 
be called a number system only if it has two binary opera- 
tions called addition and multiplication, such that each of 
these operations is associative and commutative, and mul- 
tiplication is distributive with respect to addition. In the 
vector space whose elements are ordered pairs, we have an 
addition operation that is associative and commutative. 
We also have the operation called scalar multiplication, but 
it does not help the system qualify as a number system, be- 
cause scalar multiplication is not a binary operation. In 
scalar multiplication, we multiply a vector by an element 
that is outside the system of vectors. In a multiplication 
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that is a binary operation, we would have to multiply a vec- 
tor by a vector, and get a product that is also a vector. So, 
to complete our construction, we now define a multiplica- 
tion of this kind. 

Ordered Pairs Become Numbers 

We define the multiplication of ordered pairs by the fol- 
lowing equation: (a, b) • (c, d ) = (ac — bd, ad + be). (In 
this equation, ac means a • c, in accordance with the custom 
of sometimes omitting the multiplication sign in indicated 
products.) With this definition, for example, the product 
(2, 3) • (4, 1) would be equal to (2 • 4 — 3 • 1, 2 • 1 + 
3 • 4) = (5, 14). Now we show that this operation is com- 
mutative and associative, and is distributive with respect 
to addition. 

To show that multiplication of ordered pairs is commuta- 
tive, we compare (a, b) • (c, d) with (c, d) • (a, b). 

(a, b ) • (c, d) = (ac — bd, ad + be) 

(c, d) ■ (a, b) = (ca — db, cb + da). 

But, in the real number system, ac — bd = ca — db, and 

ad + be — cb + da. So the two products are the same, and 
the commutative law is obeyed. 

To show that multiplication of ordered pairs is associa- 
tive, we compare [(a, b) • (c, d)] • (e, f) with (a, b) • 
[(c, d) ■ (e, /)]. 

[(a, b) ■ (c, d)] ■ (e,f) = (ac - bd, ad + be) ■ (e,f) 

= ([ac — bd]e — [ad + bc]f, [ac — bd]f + [ad + be] e) 

= ( ace — bde — adf — bef, acf — bdf + ade + bee). 

(a, b) ■ [(c, d) ■ (e,f)] = (a, b) ■ ( ce - df, cf + de) 

= ( a[ce — df] — b[cf + de], a[cf + de] + b[ce — df]) 

= ( ace — adf — bef — bde, acf + ade + bee — bdf). 

The first component in each of these products is the sum of 
four terms. Notice that they are the same four terms, merely 
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written in a different order. Therefore these components 
are equal. Similarly, the second components are equal, and 
therefore the two products are equal. So multiplication of 
ordered pairs obeys the associative law. 

To show that multiplication of ordered pairs is distribu- 
tive with respect to addition, we compare (a, b ) • [(c, d) + 
(e, /)] with (a, b) • ( c , d) + (a, b) -(e, /). 

(a, b ) • [(c, d) + (e, /)] = (a, b) ■ (c + e, d + f) 

= (a[c + e] - 6[d + /], a[d + /] + 6[c + e]) 

= ( ac + ae — bd — 6/, ad + a/ + be + 6e). 

(a, 6) • (c, d) + (a, 6) ■ (e,/) 

= (ac — bd, ad + be) + (ae — bf, af + be) 

= ([ac — bd] + [ae — 6/], [ad + 6c] + [af + 6e]) 

= (ac — bd + ae — bf, ad + be + af + be). 

The comparison shows that both expressions lead to the 
same result, and the distributive law is obeyed. 

With the addition defined in Chapter VI, and with this 
new kind of multiplication, the system of ordered pairs of 
real numbers obeys the five laws that are characteristic of 
a number system. So now, at last, we have a number system 
that supplies a number for every point in a plane. We saw 
in Chapter VI that each ordered pair can be represented 
pictorially by an arrow, and the operations defined for 
ordered pairs may be interpreted as operations with the 
arrows. So we have, in effect, converted the system of arrows 
into a number system. We call it the system of complex 
numbers. 

We have already observed, in Chapter VI, that the sys- 
tem of ordered pairs is an abelian group with respect to 
addition. Now that we have for this system a binary opera- 
tion of multiplication that is associative and obeys the dis- 
tributive law, the system meets the requirements for being 
a ring. But we already know that it is a vector space, 
equipped with a scalar multiplication. So the system of 
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complex numbers has a double structure and therefore 
qualifies as an algebra in the sense of the definition given 
on page 156. 


The Complex Number System Is a Field 

To qualify as a field as well as a ring, the complex number 
system must meet the requirements of having a unity ele- 
ment, and having a reciprocal for every element except the 
zero element. We observe first that the complex number 
(1, 0) is a unity element for the system, because when it 
multiplies any number in the system, it leaves that number 
unchanged: 

(1, 0) • (a, 6) = (1 • a — 0 • b, 1 • b + 0 • a) = (a, b). 

Now we shall produce a reciprocal for every complex 
number that is different from the zero element. The zero 
element in the complex number system is (0, 0) . So a complex 
number (a, b ) is the zero element only if a = 0 and 5 = 0. 
If a complex number is different from the zero element, then 
a and b are not both zero. In that case, a 2 + b 2 is not zero, 
so that division by a 2 + b 2 is possible. So, if (a, b) is not the 

zero element, the symbols ■■ 7 — „ and — -r— ; — r; represent 

a 2 + 0- a 2 + o 2 

actual real numbers, and the ordered pair ( -—7-7 -> — , , 

\a 2 +6“ a- + 6- 

is a complex number. We now show that it is the reciprocal 
of (a, b ) by showing that their product is equal to the unity- 
element : 
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For example, the reciprocal of (3, 4) is — -£ s ). 

This fact can be verified separately by multiplication: 

(3,4) • (■ , — 2 ^) = (3 • [A] — 4[— AL 3[ — A] +4[A]) 

= (A + U, - « + W = (i, 0) 

We Still Have the Real Numbers 

When we constructed the system of integers, we showed 
that there is a subset of the integers, the non-negative inte- 
gers, that is isomorphic to the system of whole numbers, and 
therefore can take its place for all practical purposes. In this 
sense, the system of integers includes the system of whole 
numbers, and is therefore an extension of the system of 
whole numbers. In the same way we showed that the ra- 
tional number system includes a subset isomorphic to the 
integers, and the real number system includes a subset iso- 
morphic to the rational numbers. Now we show that the 
complex number system includes a subset isomorphic to the 
real numbers, and is therefore an extension of the real num- 
ber system. 

Before we display this subset, let us first introduce a 
new notation analogous to the i, j, k notation used for three 
dimensional vectors in Chapter VI. The complex number 
system is a two dimensional vector space, and all of its 
elements can be expressed in terms of the two unit vectors 
(1, 0) and (0, 1). We have already seen that (1, 0) is the 
unity element of the system, so let us call it u, to remind 
us of that fact. We assign the name i to the vector (0, 1). 
Then, the complex number (a, b) = (a, 0) + (0, b) = a • 
(1, 0) + b ■ (0, 1) = au + bi. These equalities follow from 
the rules for vector addition and scalar multiplication. Now 
we shall show that the real number system is isomorphic to 
the subset of the complex number system consisting of 
those ordered pairs in which the second component is 0. All 
the numbers in this subset have the special form (a, 0) = 
a • (1, 0) = au. To prove the isomorphism we have to pro- 
duce a one-to-one correspondence in which the image of a 
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sum is the sum of the images, and the image of a product 
is the product of the images. 

The one-to-one correspondence we use is the mapping 
a <r* au. That is, we associate with each real number a the 
complex number that is equal to a times the unity element 
u. Under this mapping, the image of 1 is 1 u, the image of 
5 is 5 u, the image of J is iu, the image of y/S is y/Su, the 
image of —2 is —2 u. The image of 0 is Ou = 0(1, 0) = (0, 
0) = the zero element of the complex number system. The 
image of a + 6 is (a + b)u, and the image of ab is ( ab)u . 

Now let us compare the image of a sum with the sum of 
the images. The image of a is au. The image of b is bu. The 
sum of the images is au+bu — a( 1, 0) + 6(1, 0) = (a, 
0) + (6, 0) = (a + b, 0) = (a -f- b) (1, 0) = (a + b)u. But 
this is precisely the image of the sum a + b under the map- 
ping. So the image of a sum is the sum of the images, and 
the mapping preserves addition. 

Now let us compare the image of a product with the 
product of the images. The product of the images is (au) ■ 
(bu) = [o(l, 0)] • [6(1, 0)] = (a, 0) • (6, 0) = (a& - 0 • 
0, a • 0 + 6 • 0) = (ab, 0) = (ab) (1, 0) = (ab)u. But this 
is precisely the image of the product ab. So the image of 
a product is the product of the images, and the mapping 
preserves multiplication. Therefore, in so far as the binary 
operations of addition and multiplication are concerned, 
the complex number au behaves just like the real number 
a, and, in particular, the unity element u behaves just 
like the real number 1. However, we have one more com- 
parison to make, before we can agree that real numbers 
and complex numbers of the form au are interchangeable. 
The real number system does a special job in relation to the 
complex number system when it serves as the field of scalars 
for scalar multiplication. We have to check whether com- 
plex numbers of the form au can serve as scalars, too. To 
answer this question, we multiply any complex number 
(c, d) by the scalar a, using scalar multiplication, of course 
Then we multiply the same complex number (c, d) by the 
complex image of a, namely au. Since both (c, d) and au 

165 


are complex numbers, this multiplication will be the binary 
operation defined for the complex number system. Then we 
compare the results. 

Scalar multiplication: a(c, d) = ( ac , ad). 

Complex number multiplication : 

(au) • ( c , d) = [a(l, 0)] • (c, d) = (a, 0) • (c, d) 

= (ac — 0 • d, ad + 0 • c) = (ac, ad). 

The products are the same! Multiplication by the com- 
plex number au produces the same result as scalar mul- 
tiplication by the real number a. So the complex numbers 
of the form au and the real numbers are completely 
equivalent, and the complex number system is an exten- 
sion of the real number system. We can now dispense with 
the special symbol u, and replace it by 1, the symbol for 
the real number that it is equivalent to. Similarly, we write 
a for the complex number au. So now, the complex num- 
ber (a, 6), which we have written as au + bi, may be writ- 
ten as a + bi. The a is referred to as its real part and the 
bi is referred to as its imaginary part, and the complex num- 
ber i is called the imaginary unit. In this notation, a real 
number is a complex number a + bi whose imaginary part 
has 6 = 0. 

The fact that scalar multiplication by the real number a 
is equivalent to multiplication by the complex number au 
(now also designated by a) makes it unnecessary to retain 
scalar multiplication as a separate operation for the algebra 
of complex numbers. Any time we need scalar multiplica- 
tion, we merely think of it as the special case of the multi- 
plication of complex numbers that arises when one of the 
multipliers happens to be a real number. 

The Question Is Answered 

One of our purposes in constructing the system of com- 
plex numbers was to have a system in which the equation 
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x 2 = — 1 has a solution. We can now show that this purpose 
has been achieved. First let us be sure that we can recog- 
nize — 1 in this system when we see it. The number —1, as 
an element in the complex number system, is equivalent 
to what we called —lit before, which is —1(1, 0), or (— 1, 
0). It is in this form that we shall encounter it. The equa- 
tion that we are trying to solve asks the question, “What 
number multiplied by itself gives —1 as the product?” We 
now show that the imaginary unit i provides the answer to 
the question. 

i • i = (0, 1) • (0, 1) = (0 • 0 - 1 ■ 1, 0 • 1 + 1 • 0) 

= (-1,0) = -1. 

Now that we have established the fact that i 2 = i • i = 
— 1, we can discard the cumbersome apparatus of ordered 
pairs, and work with complex numbers in a particularly easy 
way. We write them in the form a + bi, as we have al- 
ready done, and we add them and multiply them according 
to the rules of elementary high school algebra, making use 
of the special rule that i 2 — — 1. How addition and multipli- 
cation of complex numbers are carried out in this notation 
is shown in the following examples: 

To add (2 + 3 i) -+- ( — 5 + 6i) : 

2 -p 3i 

— 5 + 6i 

— 3 — (— 9 1 


To multiply (2 + Si) ( — 5 + 6i) : 

2 -f 3 i 
— 5 -p 6 i 

-10 - 15i 

+ 12i + 18i 2 
-10 - 3i + 18( — 1) 

= -28 - 3i 
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Bad Names for Good Numbers 

Complex numbers were first introduced by the Italian 
mathematicians of the sixteenth century, who found, as we 
have, that they could not solve certain equations without 
them. However, mathematicians in those days were ac- 
customed to think in terms of only one number system of 
positive and negative numbers. They considered the num- 
bers in this system genuine, and therefore called them 
“real.” The complex numbers clearly did not belong to the 
“real” number system, so, although they used them as a 
convenience, they considered them to be spurious, or unreal, 
and called them “imaginary.” In fact, the great seventeenth 
century mathematician, Rene Descartes, was so doubtful 
of the reality of these numbers that he rejected them alto- 
gether. Today we realize that there is not only one number 
system. There are many number systems. All are equally 
genuine, although they differ from each other. Nevertheless 
we still use the old names, “real” numbers and “imaginary” 
numbers. So we must be careful not to be influenced by the 
old prejudices that are expressed in these names. When we 
use the term “imaginary numbers” now, we must bear in 
mind that it is a technical term, and should not be in- 
terpreted as a derogatory epithet casting doubt on the genu- 
ineness of the numbers. 

Imaginary numbers are genuine numbers. We have dem- 
onstrated their existence by constructing them in the form 
of ordered pairs of real numbers. However, in order to dispel 
any lingering doubts that may remain about their genuine- 
ness, we now proceed to construct them by two other 
methods. After constructing complex numbers in three dif- 
erent ways, we ought to feel certain that they are genuine, 
though not “real.” 

Complex Numbers as Matrices 

The second construction of the complex number system 
requires no new act of creation on our part. We find that 
the complex numbers are actually hidden in one of the struc- 
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tures we built up before. They form a subset of the algebra of 
2 by 2 matrices defined in Chapter VII. In fact, the com- 
plex numbers are nothing else but the 2 by 2 matrices of the 
a b 


form > in which the components in the upper left 

— b a 

hand corner and the lower right hand corner are the same 
real number, and the other two components have the 
property that each is the negative of the other. Examples 
of this type of matrix are: 


2 3 


-6 -5 


1 0 


0 0 


0 1 

-3 2 


5 -6 


0 1 


0 0 


-1 0 


It may seem strange for this to be true, in view of the 
fact that multiplication of matrices is not commutative, 
while multiplication of complex numbers is commutative. 
However, while multiplication may not be commutative for 
a system as a whole, it may be commutative for some special 
subset in the system. In any case, we can easily prove that 
multiplication is commutative for the 2 by 2 matrices that 
have this special form. We compare 


a 

-b 


c 

-d 


with 


c 

-d 


a 

-b 


and show that the products are the same. 


a b 


c d 


a(c) + b( — d) 

a(d) + b(c) 

— b a 


— d c 


— b(c) + a( — d) 

— b(d) + a(c) 


ac — bd ad + be 
— be — ad — bd + ac 


c d 


a b 


c(a) + d( — b) 

c(b) + d(a) 

— d c 


— b a 


— d(a) + c( — b) 

— d(b ) + c(a) 


ac — bd be + ad 
— ad — be —bd-\-ac. 
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An examination of the two products, component by com- 
ponent, shows that they are the same. 

Actually, it is not necessary to give this special proof 
that multiplication of these matrices is commutative. All 
we really have to do is show that the system of these special 
matrices is isomorphic with the complex number system we 
already have. Once we have established this fact, then it 
follows that the system of special matrices has all the 
properties that the complex number system has, including 
the commutativity of multiplication. 

To prove that the two systems are isomorphic, we make 

, a b 

use of the mapping a + bi = (a, b ) < > 


- b 


is associated with 

0 1 


( 1 , 0 ) 

and (0, 1) is associated with 


Under this mapping, (0, 0) is associated with 
1 0 
0 l’ 

1 0 ' This mapping is clearly a one-to-one correspond- 

ence. Now we show that it preserves addition and multipli- 
cation. The ordered pairs (a, b) and (c, d) have the sum 
(a + c,b + d). Their images in the system of matrices are 


and 


c 

-d 
b + d 


■ These images have the matrix sum 
c 

which is the image of the ordered pair 


a 

— b a 
a + c 

— b — d a + c 
(a + c, b + d) under the mapping. So the sum of the images 
is the image of the sum. 

The ordered pairs (a, b) and (c, d) have the product 

d 


a 

-b 


and 
a' 

ad + be 


c 

-d 


( ac — bd, ad + be). Their images, 

ac — bd 

have the matrix product 

— be — ad —bd-\-ac 
as shown on page 169. But this product is the image of the 
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ordered pair (ac — bd, ad + be) under the mapping. So the 
product of the images is the image of the product. Therefore 
this system of special 2 by 2 matrices is isomorphic to the 
complex number system. It is nothing but the complex 
number system masquerading in a different style of dress. 


In this representation, the number 1 appears as 

0 


The real numbers have the form 


whose real part is zero, have the form 

0 1 


The numbers bi, 
0 b 

Conse- 


quently, the number i is 
-1 0 
0 -1 


-1 0 


-b 0 
and the number —1 is 


We can verify directly by matrix multiplication that the 
number i in this form satisfies the equation x 2 = — 1 : 


= 


0 1 


0 1 


0 • 0 + 1( — 1) 

0 • 1 + 1 ■ 0 

1 

H— 1 

o 


o 

t-H 

1 


-1 - 0 4- 0( — 1) 

-1 • 1 + 0 • 0 


-1 0 
0 -1 


= - 1 . 


Complex Numbers as Residue Classes 

The third method of constructing the complex number 
system follows a procedure we have already used before. 
In Chapter III we divided the system of integers into resi- 
due classes modulo 3, by putting together into the same 
class all integers that have the same remainder when we 
divide them by 3. We defined operations of addition and 
multiplication for these residue classes. Then we found 
that, with these operations, the residue classes formed a 
number system. In Chapter IV we found out that the 
system of residue classes modulo 3 is also a field. We are now 
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going to go through these steps in the same order. First, we 
shall construct a ring that will play the same part here that 
the ring of integers did in Chapter III. Then we shall divide 
it into residue classes, by putting into the same class all 
members of the ring that have the same remainder when we 
divide by a particular member of the ring specially chosen 
for our purpose. Then, with addition and multiplication of 
residue classes defined as it was in Chapter III we shall find 
that the system of residue classes is a number system, and a 
field. Finally, we shall show that this number system is 
nothing but the complex number system in disguise. 

The elements that we use for the ring that we start with 
are the polynomials whose terms are powers of x with coef- 
ficients that are real numbers. In this system, each separate 
real number, like or 2, or \/3, is a polynomial of zero 
degree. Then there are polynomials of the first degree, like 
2x + 5, and \/3x— 7. There are polynomials of the second 
degree, like x 2 — 1, x 2 + 1, and polynomials of higher degree 
like 2a: 3 — 5X 2 + 6x + 7, X s — 4x + 1, etc. On page 134, we 
saw that this system of polynomials is a vector space. The 
addition in this vector space is the ordinary addition of 
polynomials we learned in high school algebra. The poly- 
nomials form an abelian group with respect to this opera- 
tion of addition. The zero element for this operation is the 
real number 0, considered as a polynomial of no degree. 
Now we introduce an operation of multiplication of poly- 
nomials by using the rules of multiplication taught in ele- 
mentary high school algebra. For example, we obtain the 
product (2x + 1) (3a; — 5) in this way: 

3x - 5 
2x + 1 
6x 2 — 10a; 

+ 3x - 5 
6a; 2 — 7x — 5 

This operation of multiplication of polynomials is associa- 
tive and commutative, and it is distributive with respect to 

172 


addition. So the system of polynomials has the structure of 
a ring. The unity element for multiplication of polynomials 
is the real number 1 considered as a polynomial of zero 
degree. 

The next step we take is influenced by the purpose we 
are trying to accomplish. Our goal, you will recall, is to 
construct a number system in which the equation x 2 + 1 = 
0 has a solution. The polynomial that appears in this equa- 
tion plays a special part in the next step of our construction. 
We use it as a divisor, and divide it into every other poly- 
nomial in the system. Where the division comes out even, 
we get a quotient, and a remainder equal to zero. Where the 
division does not come out even, the remainder is different 
from zero. For example, when we divide x 4 + 3x? + 2 by 
x 2 4- 1, the quotient is x 2 + 2, and the remainder is 0. When 
we divide x 2 + 1 by x 2 + 1, the quotient is 1, and the re- 
mainder is 0. When we divide 0 by x 2 + 1, the quotient is 0, 
and the remainder is 0. When we divide x 2 + 2 by x 2 + 1, 
the quotient is 1 and the remainder is 1. When we divide x 
by x 2 + 1, the quotient is 0, and the remainder is x. When 
we divide x 2 + 3a: + 5 by x 2 + 1, the quotient is 1, and the 
remainder is 3x + 4. 

In arithmetic, when we divide one integer by another, the 
remainder is always smaller than the divisor. There is an 
analogous rule for the division of polynomials. When we 
divide one polynomial by another, the remainder is always 
a polynomial of lower degree than the divisor. This is so 
because, as long as the remainder has the same degree as the 
divisor, or a higher degree, the division can be carried out 
for at least one more step. This is seen in the succession 
of steps for dividing 2x? — 3x? + 4x — 1 by x 2 + 1 : 

2x - 3 

x 2 + 1 1 2x 3 — 3x 2 + 4x — 1 
2x 3 + 2x 

— 3x 2 + 2x — 1 

- 3x 2 - 3 


2x + 2 
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The remainder in this case is 2x + 2. Since the divisor 
a; 2 + 1 is of the second degree, all remainders we get will be 
of the first degree or the zero degree. They will therefore be 
in the form a + bx, where a and b are real numbers. The 
remainder is of zero degree in the cases where b happens to 
be 0. 

Now we sort all the polynomials in the ring into residue 
classes, by putting together in the same class all poly- 
nomials that have the same remainder when they are di- 
vided by x 2 + 1. We shall designate each class by a symbol 
of the form Cp, where P is a polynomial that belongs to the 
class. Thus each class has many names, and we recognize 
that two names stand for the same class when we see that 
the polynomials written as subscripts have the same re- 
mainder when we divide by x 2 + 1. For example, Co, C** + i, 
and Cx * + s.r 2 + 2 all stand for the same class, because 
the polynomials 0, x 2 + 1, and x 4 + 3x 2 + 2 all have the re- 
mainder 0 when they are divided by x 2 + 1. The polynomial 
x belongs to the class C x . The polynomial a, where a is a real 
number, belongs to the class C a . Since different real numbers 
have different remainders when they are divided by x 2 + 1 
(each is its own remainder), no two real numbers belong to 
the same class. 

We define operations of addition and multiplication of 
residue classes in the same way that we did for residue 
classes modulo 3 in Chapter III. To add two residue classes, 
pick a polynomial from each class, and add them. Then 
identify the class to which the sum of the polynomials be- 
longs. That class will be the sum of the two classes. In the 
notation we are using, a member of each class is always put 
on display as a subscript in the name of the class. So we 
can add two classes by merely adding their subscripts. For 
example, C 2 + C 3 = Cs. C 2x + C 7 = C 2x + 7. 

To multiply two residue classes, we multiply a poly- 
nomial in one class by a polynomial in the other, and find 
the class that the product belongs to. Since a member of 
each class is put on display as a subscript in the name of the 
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class, we can multiply two classes by merely multiplying 
their subscripts. 

With these definitions of addition and multiplication, the 
system of residue classes has the structure of a ring and a 
number system. This follows from the fact that addition 
and multiplication in the system are commutative and as- 
sociative, and multiplication is distributive with respect to 
addition. We can verify the commutativity of addition very 
easily. Cp -f- Cq — Cp + q, and Cq T Cp — Cq + p. But ad- 
dition of polynomials is commutative, so P + Q = Q + P, 
and therefore Cp + q = Cq + p. The other properties are 
established by a similar argument. The zero element in the 
ring is Co, because if Cp is the class of any polynomial P, 
Co -t- Cp Co + p — Cp. 

Within this number system, there is a subset that is 
isomorphic to the real number system. This subset con- 
sists of all classes of the form C a , w'here a is a real number 
(that is, a polynomial of zero degree). To prove that it is 
isomorphic to the real number system, we have to produce 
a one-to-one correspondence that preserves the operations 
of addition and multiplication. The mapping we use for this 
purpose is a <h> Co. The mapping is a one-to-one corre- 
spondence, because no two different real numbers belong to 
the same class. Under this mapping, if a and b are two real 
numbers, their images are C a and Cb. The sum of the images 
is Ca + Cb = Ca + b, which is the image of the sum. The 
product of the images is C a • Cb = Cab, which is the image 
of the product. Therefore the set of classes of the form Ca, 
where a is a real number, is isomorphic to the real number 
system. Since isomorphic systems are the same for all prac- 
tical purposes, we can use the symbols of one system to 
represent the other. So now, instead of writing C a for the 
class to which a real number a belongs, we shall simply 
write a. Thus 0 will stand for Co, 1 will stand for Ci, and 
— 1 will stand for C-i. 

Now that we have found the real numbers within our new 
system, we know that it is an extension of the real number 
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system. Next we observe that it contains an element that 
is a solution to the equation x 2 + 1 = 0. To show that this 
is so, we must produce a class Cp which has the property 
that (Cp) • (Cp) + 1 = 0. The class that has this property 
is the class C x . In fact, C x ■ C x + 1 = C x ■ C x + Ci = 
C+ + Ci = Car> + i = Co = 0. Notice that we made use of the 
fact that when x 2 + 1 is divided by x 2 + 1, the remainder 
is 0. so that C+ + i = Co. 

The class C x therefore has the same property as the com- 
plex number we called i before. In fact, since x 2 has the 
remainder —1 when we divide it by x 2 + 1, C x • C x = C+ = 
C-i = — 1. Let us therefore use the symbol i to stand for C x , 
since the rule i 2 = — 1 holds for C x as well. 

Now only one more step remains to show that this system 
of residue classes is essentially the same as the complex 
number system we built up before. We show that it is iso- 
morphic to the complex number system. First let us remind 
ourselves that every class in the system can be represented 
by the remainder that the members of the class have in 
common. Secondly, we recall that all remainders, when we 
divide by x 2 + 1, are first degree polynomials of the form 
a + bx where a and b are real numbers. So every member 
of our new class of numbers can be represented in the form 
Ca + b X . Now, using the rules for addition and multiplica- 
tion of residue classes, we see that C a + b X can be written as 
Ca + Cb ' C x . We have already agreed to write a instead of 
Ca, and b instead of Cb, because classes of this special form 
behave like real numbers. We have also agreed to write i 
for Cr, because the rule that i 2 = — 1 correctly describes the 
behavior of this class. So we can write C a + Cb • C x in the 
form a + bi, with the understanding that a and b behave 
like real numbers and i has the property i 2 = — 1. When we 
write it this way we see that the residue class C a + t* is 
nothing but the complex number a + bi in disguise. The 
system of residue classes modulo (x 2 + 1) is therefore the 
field of complex numbers. This completes the third con- 
struction of the complex number system. 

We have now seen the system of complex numbers in 
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three forms of dress. We have seen it as a system of ordered 
pairs, as a system of two by two matrices, and as a system 
of residue classes in a ring of polynomials. In each of these 
systems, an individual complex number has a different ap- 
pearance. But the structure of the three systems, in so far 
as their addition and multiplication tables are concerned, is 
the same. So we recognize them as merely different repre- 
sentations of one and the same number system, whose ele- 
ments are usually represented in the convenient form a + bi. 

No More Extensions Needed 

We have expanded our number system five times. Four of 
these expansions to a more extensive system w r ere made 
necessary by the fact that a certain type of equation could 
not be solved in the less extensive system. In the system of 
whole numbers, we could not solve an equation like x + 5 = 
3. So we constructed the system of integers, where all equa- 
tions of the form x + b = a have a solution. In the system of 
integers, we could not solve an equation like 2x = 3. So we 
constructed the system of rational numbers, in which all 
equations of the form ax = b have a solution, as long as a 
is not equal to 0. At this stage, we found that we could al- 
ways solve equations of the first degree, whose general form 
is ax + b = 0. But we could not say the same for equations 
of the second degree. To be able to solve the equation x 2 — 
2 = 0, we had to construct the real number system. To be 
able to solve the equation x 2 + 1 = 0, we had to construct 
the complex number system. 

Now it is worth asking what equations we can solve in 
the complex number system. So far we have examined only 
equations of the first and second degree. If we try other 
algebraic equations of higher degree, will we find any that 
cannot be solved in the complex number system? Will it 
be necessary to expand the number system again and again, 
as we try more and more complicated algebraic equations? 
If you have had visions of an endless chain of extensions of 
the number system, you can dismiss them at once. As far 
as algebraic equations are concerned, we have reached the 
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end of the road. The complex number system not only gives 
us a solution to the equation x 2 + 1 = 0. It gives us a 
solution for every other algebraic equation as well. To be 
more specific, it is known that every algebraic equation of 
the form 

a n x n + gu-iZ"- 1 ■ • • a,x + a 0 = 0, 

whose coefficients are complex numbers, has a solution in 
the complex number system. In fact it has as many solu- 
tions as the degree of the equation indicates. A first degree 
equation has one solution, a second degree equation has 
two solutions, a third degree equation has three solutions, 
and so on. The fact that algebraic equations can always be 
solved within the complex number system is known as the 
Fundamental Theorem of Algebra. It was first proved by 
the great German mathematician, Carl Friedrich Gauss 
(1777-1855). 

The Old in the New 

We have accomplished the purpose we aimed for at the 
beginning of the book. We have seen how, through succes- 
sive extensions of the number system, mathematicians 
have eliminated its defects while losing none of its virtues. 
We have been introduced to a variety of mathematical 
structures, like groups, rings, fields, vector spaces, and topo- 
logical spaces, that are being explored vigorously in mathe- 
matics today. We have found that, although their names 
are new, and at first sound unfamiliar, they are closely re- 
lated to such familiar things as addition and multiplication 
of numbers, and collections of points on a line. Although 
the world of modern mathematics is a new world in many 
ways, it has never lost contact with the old world of num- 
ber and space from which it has grown. 

DO IT YOURSELF 

1. Use the definition of multiplication of ordered pairs (see 
page 161) to find the following products: 

(2, 3). (4,1) (2,3) -(2,-3) (2, 0) • (0, 3) 
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2 . 


3. 

4. 


5. 


Write each of these ordered pairs in the form a + hi, 
making use of the convention that 1 stands for (1, 0), 
and i stands for (0, 1) : 

(2,3) (2,-3) (3,0) (0,2) 

Verify by multiplication that J is the reciprocal of 
1 + i. (Remember that i 2 = — 1) 


Let 1 stand for the two by two matrix 


i stand for the two by two matrix 


0 

-1 


1 

0 

1 

0 


0 

1 


> and let 


Write the two by two matrices that represent 2 + 3 i and 
2 — 3 i. Verify by matrix multiplication that (2 + Si)- 
13 0 


(2 - Si) = 13 = 


0 


13 


Use the rules for addition and multiplication of residue 
classes modulo (x 2 + 1) to verify the following sums or 
products : 


c 2x ,+ 1 + C, = Co. C x ■ C 2z = C — 2 * 
Cx + 1 • Cx+ 2 = Czx+l- C x , + Cz = Cj. 
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I 


Summary of Basic 
Definitions 


Zero element: A is a zero element for addition if A 4- X = 
X + A = X, for every X. 

t nity element: U is a unity element for multiplication if 
U ■ X = X • U = X, for every X. 

Identity element: E is an identity element for the opera- 
tion * if E * X = X * E — X, for every X. 

Negative: A is the negative of B if A + B = B + A = 0. 

Reciprocal: A is the reciprocal of B if A • B = B • A = 1. 

Inverse : A is the inverse of B with respect to the operation 
* whose identity element is E, if A * B = B * A = 
E. 

Commutative : Addition is commutative if x + y — y -f x. 
Multiplication is commutative if x • y = y ■ x. 

Associative: Addition is associative if (x + y) + z = x 4- 
(y + z). Multiplication is associative if (x • y) • z — 
x- (y ■ z ). 

Distributive: Multiplication is distributive with respect to 
addition if x • (y + z) — x • y + x ■ z. 

Number system: A set of elements is a number system if 
it has two binary operations called addition and 
multiplication, each of which is commutative and 
associative, and if multiplication is distributive with 
respect to addition. 

Group: A system is called a group if it has a binary opera- 
tion that is associative, has an identity element for 
the operation, and has an inverse for every element. 
If the operation is commutative, the group is called 
abelian. 
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Ring: A system is called a ring if it has two associative 
binary operations called addition and multiplication, 
is an abelian group -with respect to addition, and if 
the multiplication is distributive with respect to ad- 
dition. 

Field: A ring is called a field if it has a unity element for 
multiplication and contains a reciprocal for every 
element except 0. 

Topological Space: A system of elements is called a topo- 
logical space if a collection of its subsets is singled 
out to be called “open sets,” and the collection has 
these properties: 1) The whole space and the empty 
set belong to the collection. 2) The union of any 
number of sets in the collection is also in the collec- 
tion. 3) The intersection of any two sets in the col- 
lection is also in the collection. 

Vector Space: A system is a vector space if it is an abelian 
group with respect to addition, is subject to a scalar 
multiplication by elements from an associated field 
of scalars, and if the scalar multiplication obeys 
these laws: 

— ► — ► — ► — ► 

r • (x + y) = r ■ x + r ■ y. 

(r-t-s)-x = r- x + s- x 

r ■ (s ■ x) — (r • s) • x 

1 • x = x. 

Algebra: A system is an algebra if it is provided with binary 
operations of addition and multiplication, and a 
scalar multiplication, that make it both a vector 
space and a ring. 

Geometry: A geometry is a study which identifies those 
figures in a space which are equivalent to each other 
under a group of transformations, and determines 
what properties equivalent figures have in common. 
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Appendix I 


Do It Yourself Supplement 


CHAPTER I 


Ex. 1-3 are on pp. 34 -34a of the text. 


4. In Ex. 2, p. 34, an addition operation is defined for the set whose 
elements are a and b. If this operation obeys an associative law, 
then x+(y + z) = (x+y) + z for all values of x, y and z. 

a) The associative law stated above is an abbreviated way of mak- 
ing eight separate statements, obtained by replacing x, y and z by 
a or b. Write these eight statements. 

b) Prove the associative law for this system by verifying from the 
table that all eight statements are true. 

5. A multiplication operation is defined for the system of Ex. 2 by 
this table: 


a 

b 


a 

a 

a 


b 

a 

b 


a) Does this system have a unity element? 

b) Show that multiplication is commutative in this system. 

c) Show that multiplication is associative in this system. (Write 
the eight possible statements of the form x • (y • z) = (x • y) • z, 
and verify them from the table.) 

d) Show that this multiplication is distributive with respect to the 
addition defined in Ex. 2. (Write the eight possible statements of 
the form x • (y + z) = x • y + x • z, and verify them from the tables.) 
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e) Using the same tables, show that addition is not distributive 
with respect to multiplication. (Write the eight possible state- 
ments of the form x + (y • z) - (x + y) • (x + z), and show that they 
are not all true.) 

f) What conclusion follows from the results of 2b, 4a, b, and 5b, 
c, d? 

6. Define addition and multiplication for the system of two elements 
0 and 1 as follows: 


+ 

0 

1 


0 

1 

0 

0 

1 

0 

0 

0 

1 

1 

0 

1 

0 

1 


a) Show that 0 is a zero element and that lisa unity element for 
the system. 

b) Show that, with the operations defined by these tables, the sys- 
tem is a number system. 

c) Show that this system is isomorphic to the system defined in 
Ex. 2 and 5. 

7. Define addition and multiplication for the system of three elements 
a, b and c as follows: 


a) Show that a is a zero element and b is a unity element. 

b) Show that the tables define a number system. 

8. If a and b are both zero elements in a number system, prove that 
a = b. 

9. If a and b are both unity elements in a number system, prove that 
a = b. 

10. Show that (20 av 24) av 36 ^ 20 av (24 av 36). (See p. 21.) 

11. Define a binary operation D for the system of natural numbers (in- 
cluding 0) as follows: 

If a — b, aDb =0. If a £ b, aDb = the larger of the numbers minus 
the smaller one. (For example, 5D5 = 0; 6D2 = 4; 3D4 = 1.) 

a) Prove that the operation D is commutative. 

b) Prove that the operation D is not associative. 

12. Define a binary operation S for the system of natural numbers as 
follows: aSb = 0, if a is less than b; aSb = 1, if a is greater than b; 
and aSa = a. 

a) Construct the table for the operation S. 

b) Prove that the operation S is neither commutative nor associative. 
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13. Determine whether the binary operation * defined for the system of 
natural numbers is commutative or associative if 

a) x * y = * + 3y. 

b) x*y=x + y + 5. 

14. a) Use ordinary addition and multiplication as operations on the 
set 10, 3, 6, 9, ...,i containing all the natural numbers that are 
multiples of 3. Show that, with these operations, the system of 
multiples of 3 is a number system. 

b) k < > 3 A defines a one-to-one correspondence between the 

system of natural numbers and the system of multiples of 3. Show 
that this correspondence is not an isomorphism. 


CHAPTER II 


Ex. 1-3 are on p. 43 of the text. 

4. Assign names to the subsets found in Ex. 3. 

a) Construct a table for the union operation for this system of 
subsets. 

b) Construct a table for the intersection operation. 

5. Assign names to the subsets of 1x1 as follows: 1 = |x|, 0 = 1 j. 
Construct a table for the union operation and intersection operation 
for this system of subsets. 

6. Let I = la, b, c, d, e, f, g, h\. If A is a subset of /, let X' represent 
the complement of X in 1. 

Let R = la, b, e, f\ S = ja, b, g. At T = la, b, c, d\ 

U = \g, h\ V = ja, b, cl W = la, e, At 

Z-\b, d, g \ 

List the elements in each of these sets: 

a) R\ S', T', U', V', W', Z'. 

b) R U S, R U T, R U U, R U V, R U W. 

c) R n S, Rn T, RCl U, RCl V, ROW. 

d) RUZ,SUT,SUU,SUV,S\JW. 

e) r n z, s r\ t, s n u, s nv, s nr. 

f) S U Z, T U U, T u V, T U W, T U Z. 

g) s n z, t n u, t n v, tow, t n z. 

h) U\JV,U\JW,U\JZ,VUW,VUZ,WUZ. 

i) u r v, u nw, u n z, v n w, v nz, w r\ z. 

j) RUS',T\JW',U\JV',UUZ',W\JZ'. 

k) R'U s, T’U W, U' UF,»'U Z, W'\J Z. 

l) flns , ,7'nr | ,t/n^ ,, ,^/nz , ,rnz , . 

m) R' n s, T' n w, u'n v, u< n z,w' n z. 
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CHAPTER III 


Ex. 1-6 are on pp. 71-2 of the text. 

7. Add (6 - 2) + (2 - 5); (10 - 1) + (1 - 10); (7 - 4) + (12 - 3). 

8. Multiply (6 - 2) • (2 - 5); (10 - 1) - (1 ~ 10); (7 ~ 4) . (12 - 3). 

9. Use the test for equality of integers to prove that 
(5 - 2) = (7 - 4). (See p. 48.) 

10. a) Add (5 - 2) + (x ~ y). 

b) Add (7 - 4) + (x ~ y). 

c) Use the text for equality of integers to show that the answers 
to a) and b) are equal. 

11. a) Multiply (5 ~ 2) • (x ~ y). 

b) Multiply (7-4) • (x ~ y). 

c) Use the test for equality of integers to show that the answers 
to a) and b) are equal. 

12. Prove fa ~ b) + [fc ~ d) + (e ~ f) 1 = [fa ~ b) + (c ~ dj\ + (e - f). 

13. Prove (a~ b) ■ [fc ~ d) ■ (e ~ f)] = [(a ~ b) ■ (c ~ dj] ■ (e ~ f). 

14. Prove (a~ b) ■ [fc ~ d) + (e ~ f) 1 = (a - b) ■ fc ~ d) + (a ~ b) • (e - f). 

15. Prove (a + d ~ b + d) = (a ~ b). 

16. If A is an integer, prove (-1) • A = -A. (Hint: Let A = (x ~ y). 

Then —A = (y ~ x). — 1 = (0 - 1).) 

17. Prove that in any ring with a unity element (-1) • A = -A. 

(Hint: (-1) + 1=0. Multiply both sides by A on the right.) 

18. Prove that if A and B are elements of a ring, 

a) (-A) ■ (B) = -(A ■ B). b) (A) ■ (-B) = -(A ■ B). 

c) (-A) ■ (-B) = A ■ B. d) -(A + B) = (-A) + (-B). 

(Hint: Use the fact that A + f-A) - 0, and B + (-B) = 0.) 

19. Define eight permutations on the letters A, B, C, D as follows: 

1 P Q R S T V V 

A-.A A-+B A -* C A -*D A-A A->C A B A~*D 

B B B->C B ~*D B^A B-D B-.B B-4 B^C 

C-C C -+D C-A C-*B C-*C C-A C-D C-S 

D~*D D -* A D-*B D-C D-fi D-D D-C D — A 

a) If consecutive vertices of a square are called A, B, C, D respec 
tively, show that I, P, Q, R, S, T, U, V are the symmetries of the 
square. 

b) Define * as in Ex. 4, and construct the multiplication table for 
this set of permutations. 

c) Prove that they form a group with the operation *. 

d) Prove that /, P, Q, R form a subgroup. 

e) Prove that /, S form a subgroup. 

f) Find four more subgroups. 
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20. Use the associated remainders, 0, 1, 2, 3, as the names of the 
residue classes modulo 4. Construct addition and multiplication 
tables for these residue classes. What element is a zero divisor 
in this system? 

21. Show that the set of all multiples of 4 is a subgroup of the group 
of integers with respect to the operation +. Show that it is a sub- 
ring of the ring of integers. Show that it is an ideal of the ring of 
integers. 

22. Every integer can be written in the form 3 n + r, where r = 0, 1, or 2, 
and n is an integer. 3ra + r— * r defines a mapping of the ring of in- 
tegers onto the residue class ring modulo 3. Show that this map- 
ping is a ring homomorphism. 

23. Show that the subgroup /, P, Q, R in Ex. 19 is isomorphic to the 
residue class group modulo 4 with + as group operation. (Ex. 20.) 

24. Let K = the subgroup I/, P, Q, R I in Ex. 19. 

Denote the set IS*/, S*P, S*Q, S*R 1 by S*K. 

Denote the set t/*S, P*S, Q*S, R*S I by K*S. 

S*K is called a left coset of K, and K*S is called a right coset of K. 

a) Show that in this case, S*K = K*S. 

b) Show that S*K = T*K = U*K = V*K. 

c) Denoting the coset obtained in b) by L, define the operation * 
for the set [K, L i as on p. 67, and construct the multiplication 
table for the set \K, H with this operation. Show that the set 

\K, L \ with the operation * is a group (a quotient group). Show that 
it is isomorphic to the subgroup 1/, S|. 


CHAPTER IV 

Ex. 1-2 are on p. 89 of the text. 

3. Prove that multiplication of rational numbers is commutative. 

4. Prove that addition of rational numbers is associative. 

5. Prove that multiplication of rational numbers is associative. 

6. Prove that multiplication of rational numbers is distributive with 
respect to addition. 

7 . a)Add (f) + (^) 

b) Add(^) +(y) 

c) Use the test for equality of rational numbers to show that the 
answers to a) and b) are equal. 
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V b ■ C ) V y ) 

c) Use the test for equality of rational numbers to show that the 
answers to a) and b) are equal. 

9. a) Multiply (4) • 

b) Multiply (4) ■ (y) 

c) Use the test for equality of rational numbers to show that the 
answers to a) and b) are equal. 

10. a) Multiply ) 

b) Multiply (-J-4). (y) 

c) Use the test for equality of rational numbers to show that the 
answers to a) and b) are equal. 

11. Prove that (-1) + ( Ly = 

12. If c ^ 0, prove that (~^~)~ (-£ — — ) 

13. Define “greater than” (denoted by >) for the rational number sys- 
tem as follows: a > b if a-b is positive. (See p. 54.) If a and b 
are rational numbers, and a> b, let m = Vi(a + b). 

Prove that a > m, and m > b. 


CHAPTER V 


F.x. 1-4 are on p. 124 of the text. 


5. Every even integer has the form 2k, and every odd integer has the 
form 2 k + 1, where k is an integer. Use this notation to prove that 
the square of every even integer is an even integer, and the square 
of every odd integer is an odd integer. 

6. Find the repeating decimal that represents 


4 4 4 




7 . 


Find the rational number represented by the repeating decimal: 
a) .555... b) .777... c) .060606... 

d) .050505... e) .121212... f) .161616... 


g) .123123123... h) .025025025... i) 6.257 57 57... 

j) 3.2777... 


8. The method used on page 107 for expressing a repeating decimal 
as a fraction can be used for finding the sum of some convergent 
series. For example, if x = 1 + % + (%) 2 + ( l / 2 ) 3 + 

then * = 1 + 'All + 'A + ('A) 1 + ...1 and x = 1 + 'Ax. Solving this 
equation for x, we find that x = 2. Use this method to find the 
sum of each of these series: 



9. Which of these sets of real numbers are open? closed? neither 
open nor closed? 

a) ! 1, 2, 3, 4, 5S 

b) The set of all integers. 

c) The set of all numbers x such that 2 < x < 3. 

d) The set of all numbers x such that 2 A x < 3. 

e) The set of all numbers x such that 2 i? x ^ 3. 

f) The set of numbers of the form 5-1 where n is a positive 

integer. 


CHAPTER VI 

Ex. 1-6 are on pp. 146-7 of the text. 


7. Express as an ordered pair: 

a) (2, 4) + (x, 6). b) (3, -1) + (7, 2). 

c) (6, -4) + (-2, 3) + (1, 1). d) 4(2, -1) + 3(4, -2). 
e) 2(-3, 1) + 4(5, -2). f) t(2, 1) + (5, 7). 

8. Solve the following equations for the vector (x y)- 

a) (x, y) + ( 2, -.3) = (0, 0). 

b) (x, y) + (4, 1) = (7, 2). 

c) 2(x, y) + (3, -2) = (4, 1). 

9. Solve for the scalars x and y: 

a) x(2, 3) +y( 4, -1) = (16, 3). 

b) x(2, 4) + y( 9, 6) = (4, 4). 

c) x( 1, 0) + y(0, 1) = (0, 0). 


191 


10. Find the determinant of each of these linear transformations: 

lx' = 3* - 2 y 

y' = 4x — 3y 
lx' = *+y 

w \y’=2 *-y 

11. a) Which of the transformations in Ex. 10 have an inverse? 
b) Find these inverses. (See p. 143.) 

12. Using A, B, C, D, as defined in Ex. 10, 

a) find the products A*B and B*A. 

b) find the products A*C and C*A. 

c) find the products A*D and D*A. 


j x' = 2x + 3y 

B: 

' | y 1 = 4x + 6 y 


r j x’ = 2c + 3y 
\ T ' = 3x + 5y 

D: 


d) find the products B*C and C*B. 

e) find the products B*D and D*B. 

f) find the products C*D and D*C. 


CHAPTER VII 


Ex. 1-4 are on p. 159 of the text. 


5. Prove that 

a b 


r 0 


a b 

r 

c d 


0 r 


c d 


6. Prove that 

a b c 


r 0 0 


a b c 

T 

d e f 
g h i 

= 

0 r 0 

0 0 r 


d e f 
g h i 


7. The transformation B is defined by: x' — 2x + y 

y' = 3x + 2 y. 


a) Find B '. 

b) Write the matrices of B and B 

c) If we denote these matrices by B and B~' respectively, show 
that B B~ l ‘ B~' B - I. 


8. If A = 

r 0 

and B = 

x 0 


0 s 

» 

0 y 

9. If A = 

r 0 

and B = 

x y 


0 r 


z w 

10. If A = 

r 0 

and B = 

X y 


0 s 


z w 


prove that AB = BA. 


prove that AB = BA. 


find AB and BA. 
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11. If A = 

a 

0 

0 

and B = 

X 

0 

0 


0 

b 

0 

9 

0 

y 

0 


0 

0 

c 


0 

0 

z 

12. If A = 

c 

0 

0 

and B = 

X 

y 

z 


0 

c 

0 

» 

u 

V 

w 


0 

0 

c 


r 

s 

t 

13. If A = 

a 

0 

0 

and B = 

X 

y 

z 


0 

b 

0 

» 

u 

V 

w 


0 

0 

c 


T 

s 

t 


prove that AB = BA. 


prove that AB = BA. 


find AB and BA. 


CHAPTER VIII 


Ex. 1-5 are on pp. 178-9 of the text. 


6. Multiply out, and express the answer in the form a + bi: 

a) i 3 b) ( 2 + r) 5 

c) i* d) (3 + 4t)(5 - 2 i) 

e) (1 + i) 2 f) 1(2 + 3 i) 

g) (2-3i)(2+ 3i) h) (c + di) (e + fi) 

7. Prove that (a + bi)(a — bi) is a real number. 

8. Find the reciprocal of 

a) i b) 6 + 8 i 

c) i l d) 1 + i 

e) 5 + 12t 


9. If 1 = 

1 

0 

and i = 

0 1 


0 

1 


-1 0 


write each of these complex numbers as a two by two matrix: 
a) 1 + i b) 2 + 2 i c) 2 + i 

d) 2 - i e) 3 + i f) 2 - 2 i 

g) -1 + i h) 3 - i i) -1 - i 

10. Use the results of Ex. 9 to find these products by matrix multipli- 
cation; then check by the method of multiplication shown on 


p. 167: 

a) (1 + i) (3 + i) 
c) (2 - i)(2 + i) 
e) (-1 + i) (-1 - i) 
g) (3 + i) (3 - i) 


b) (2 - 2i) (3 + i) 
d) ( 2 - 2t) ( 1 + i) 
f) (2 — /) (1 + i) 
h) (2 - 2t) (-1 - i) 


193 


11. In any field, we can define division as follows: 

A ± B = A • (-g-)- With the help of this definition, express in the 
form of a + bi: 

\ 1 + i » \ 2 — i \ 3 + i i 

a) -r — — b)— ^r-. c)-= d)^ r 

3+4i 5+12r l + i l-i 

12. If x = a + bi, a - bi is called the conjugate of x, and is denoted by 
x. If x and y are complex numbers, prove: 

a) (x) = x. 

b) x + y = x + y. 

c) (x) ■ (y ) = (x ■ y ). 

d) x - y = (x - y). 

-r®. 

f) x + x is a real number. 



is a real number. 
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Appendix II 


The Changes Taking 
Place in Mathematics * 


W here is now a vigorous movement under way 
to reexamine and modify the teaching of mathematics. The 
avowed purpose of this movement is to bring the teaching of 
mathematics up to date by taking into account the changes 
that have taken place in mathematics. It is necessary to begin, 
then, by making a survey of the nature and significance of 
these changes. We make such a survey in this chapter. This 
is not the first such survey, nor will it be the last. The im- 
provement of instruction in mathematics is a continuing 
process that never ends. Those who participate in this process 
must keep turning their heads to look repeatedly, now at 
mathematics, now at the schools. I hope that by taking a 
new look at the new mathematics in this chapter, we may 
find some fresh insights into what must be done to make the 
teaching of mathematics more effective. 

What are the distinguishing characteristics of contempo- 
rary mathematics? It seems to me that there are four of them, 
which I would describe as follows: 

“This is the text of the keynote address delivered by Dr. Adler at the 1961 
Conference in Washington, D. C„ of state supervisors of mathematics, called 
by the United States Office of Education of the Department of Health 
Education and Welfare. 
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1. Contemporary mathematics is classical mathematics 
grown mature. 

2. Contemporary mathematics is classical mathematics 
grown self-conscious and self-critical. 

3. It is also modern mathematics, which developed as a 
more efficient way of dealing with the content of classical 
mathematics. 

4. Finally, it is mathematics that is more and more inti- 
mately related to man’s activities in industry, social life, 
science and philosophy. 

We shall examine each of these four characteristics in turn. 


Classical Mathematics Grown Mature 

Classical mathematics may be described as the study of 
number and space. The study of number became arithmetic, 
algebra and analysis. The study of space became geometry. 
We shall note and comment on six aspects of the growth to 
maturity of classical mathematics. 

Change from number system to number systems 

We no longer have just one number system. We have many 
number systems. The pluralization of the concept of number 
system developed in two ways. First, there was a steady ex- 
pansion of the original number system of everyday use. At 
first the only numbers known were the natural numbers, used 
for counting. The requirements of measurement led to the 
introduction of rational numbers. Geometric theory led to 
the introduction of irrational numbers. Algebraic theory led 
to the introduction of negative numbers and complex num- 
bers. If we recognize as a number system any set of numbers 
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that is closed under addition and multiplication, subject to 
the usual associative, commutative and distributive laws, then 
we see that the expansion of the number system has given us 
five number systems, one within the other. We have the 
system of natural numbers within the system of integers 
within the system of rational numbers within the system of 
real numbers within the system of complex numbers.* 

Secondly, more number systems were introduced in the 
course of the development of arithmetic and algebra. In the 
first place there was a modification and elaboration of the con- 
cept of number system by the introduction of the concepts of 
group, ring, field, and vector space. There was also the dis- 
covery of many new structures which were like number systems 
in the sense that they too were groups, or rings, or fields or 
vector spaces. For example, the set of all permutations of n 
objects is a group, where permutations are “multiplied” by 
performing them one after the other. The set of numbers 
a + b \/2, where a and b are rational, is a field. The set of 
residue classes of integers modulo an integer m is a ring. It is 
a field if m is a prime number. The set of all polynomials with 
real coefficients is a ring. If p is a prime, and q is a power of p, 
then the set of all roots modulo p of the equation x Q — x = 0 is 
a field. The set of n by n matrices whose terms are complex 
numbers is a vector space over the complex field, etc. Addi- 
tional systems that were something like number systems were 
also discovered in other areas of study. In the study of logic an 
algebra of propositions was developed in which disjunction 
played the role of addition and conjunction played the role of 
multiplication. In the study of the theory of sets, an algebra of 

* See Irving Adler, The New Mathematics (New York, The John Day 
Company). 
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subsets of a set was developed in which union played the role 
of addition and intersection played the role of multiplication. 

An incidental but important result of the refinement and 
pluralization of the concept of number system was that we 
finally learned how to ask a question sensibly in algebra. We 
used to look for roots of an algebraic equation without know- 
ing exactly what we were looking for. As a result we weren’t 
quite sure about whether we could believe in the results we 
obtained. For example, some mathematicians, like Descartes, 
used to reject complex roots as meaningless. Now we are very 
specific in formulating our questions in an unambiguous man- 
ner. We specify a domain in which a problem is defined and 
in which an answer is sought. For example, we may say, 
“Given an algebraic equation with coefficients in the real field, 
does it have roots in the real field? Does it have roots in some 
extension field of the real field?” 

Change from geometry to geometries 

We no longer have just one geometry. We have many ge- 
ometries. The pluralization of the concept of geometry devel- 
oped in two separate ways. We found that there are other 
spaces besides the traditional three-dimensional Euclidean 
space. We also found that there are many geometries within 
any given space. 

The attempts to prove the parallel postulate led to the rec- 
ognition that there is a space (Lobatschewskian) that satisfies 
all the axioms of Euclid except the parallel postulate. Thus we 
got two geometric spaces, Euclidean and non-Euclidean, in- 
stead of one. Then Riemann gave us an infinite number of 
spaces by showing that a space can be constructed by using any 
one of an infinite number of quadratic forms to define a metric 
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on a manifold. More spaces were obtained by recognizing the 
validity of spaces of n dimensions, where n is any positive 
integer.* 

In the study of Euclidean space of three dimensions, many 
separate geometries emerged. One is the geometry of congru- 
ence and a second is the geometry of similarity. In addition 
there are the geometry of incidence relations (projective ge- 
ometry), the geometry based on concepts of nearness and con- 
nectedness (topology), and many others such as inversion 
geometry and affine geometry. 

Unification of the concepts of number and space 

The separate disciplines of algebra and geometry converged 
toward each other and joined in Cartesian geometry. This re- 
sulted in a fusion of the notions of number and space. The real 
number system, for example, is nothing but a Euclidean line, 
and the complex number system is nothing but a Euclidean 
plane. Algebraists, of course, would prefer to say it the other 
way around. 

Development of analysis 

Various concepts and techniques that are based on the idea 
of limit were introduced into mathematics. These include such 
notions as continuity, the sum of an infinite series, derivative 
(which is a limit of a quotient), and definite integral (which 
is a limit of a sum). The development of analysis led to a new 
dichotomy in the subject matter of mathematics to replace the 
old one of number versus space. We recognized that the num- 
ber system (real or complex) has an algebraic aspect, found 
in the properties of the operations addition and multiplication, 

* See Irving Adler, A New Look at Geometry (New York, The John Day 
Company). 
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and a topological aspect based on the concept of nearness and 
which underlies all considerations of convergence. These two 
aspects of the number system can be studied separately or in 
combination. 

Evolution of the concept of function 

There has been a gradual development of a more and more 
general concept of function. The first functions employed 
were algebraic functions. The study of trigonometry led to the 
introduction of the circular functions. Calculus contributed the 
logarithmic function (the integral of dx/x), and its inverse, 
the exponential function. The pendulum problem contributed 
elliptic integrals and functions. The theory of heat contributed 
functions defined by Fourier series. Experience with many 
functions ultimately led to recognition of the fact that a func- 
tion need not have a derivative, and does not even have to be 
continuous. 

Development of theories of functionals 

A functional is a special kind of function. Its domain is a 
set of subsets of a space. Its range is a set of numbers. A typical 
example of a functional is a line integral. Our common meas- 
ures of length, area and volume are also functionals that assign 
a number to a set of points in Euclidean space. The study of 
physics introduces other functionals such as mass, electrical 
charge, etc. 

A special branch of mathematics, the calculus of variations, 
deals with the problem of maximizing or minimizing the value 
of a functional. It tells us, for example, that a plane figure 
whose perimeter has a given length has the maximum possible 
area if its perimeter is a circle. 
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Classical Mathematics Grown 
Self-Conscious and Self-Critical 

The tumultuous growth of mathematics during a period of 
more than 2,000 years produced a great superstructure resting 
on a very shaky foundation. During the second half of the 
nineteenth century, mathematicians undertook a systematic 
analysis of the foundations of mathematics in order to plug the 
holes that had developed. We note some of the questions that 
they raised and answered. The answers to these questions have 
helped to give contemporary mathematics its characteristic 
flavor. 


What is a number? 

Weird numbers like the square root of 2 or the square root 
of —1 had been introduced as a convenience. What did they 
really mean, and why were they legitimately called numbers? 
The answer to this question was supplied by what we might call 
“Operation Bootstrap” — the constructive definition of larger 
and more adequate number systems with the help of the smaller 
and less adequate ones. The natural numbers are used to con- 
struct the integers, the integers are used to construct the ra- 
tionals, the rationals are used to construct the reals, and the 
reals are used to construct the complex number system. At each 
stage of the construction, it is proved as a theorem in the 
smaller system that the larger system that has been constructed 
really exists. This removes once and for all the mystery that 
lurked in the shadows of our earlier vague notions of the irra- 
tional and the imaginary. 
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What is a continuum? 

This question had plagued mathematics since the days of 
Zeno and his famous paradoxes. The answer was supplied by 
the Dedekind-Cantor theory of irrationals. Developing this 
theory required excursions into the theory of point sets. 

What is infinity? 

The concept of infinity crept into mathematics through many 
doors. It entered in the form of the infinite divisibility of the 
Euclidean line. It also entered in the form of the infinite ex- 
tensibility of the Euclidean line. It popped up again in infinite 
series and the definite integral. Riemann cleared up one con- 
fusion by making the distinction between unbounded and in- 
finite. A plane and a sphere, for example, are both unbounded, 
but while a plane is infinite, a sphere is finite. Cantor finally 
brought infinity under control by developing the theory of sets 
and of transfinite numbers. The theory of sets had its own trou- 
bles in the form of apparent contradictions. But these were 
ultimately eliminated by axiomatizing the theory, and by ex- 
cluding from consideration such unmanageable ideas as “the 
set of all sets.” 

What is a variable? 

For quite some time mathematicians relied on the rather 
meaningless answer that a variable was a number that changed 
while it was under discussion. A better and more meaningful 
answer came from the further development of the science of 
logic: A variable is merely a dummy that may be replaced by 
any element of a given set. Closely related to the concept of 
variable is that of an open sentence. An open sentence is a 
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sort of printing press for printing a lot of statements that have 
the same form. A statement is produced by the open sentence 
by replacing each variable in it by some element selected from 
a specified set. 

What is a function? 

In the happy days of the past, all the functions people ever 
had to use were represented by well-behaved analytic expres- 
sions. This led to very naive notions of what a function is, 
based on the use of formulas or graphs. It was assumed that 
every curve defined a continuous function and that every con- 
tinuous function had a derivative. But then pathological cases 
began to arise, such as a continuous function that had no deriv- 
ative or a curve that filled a square. The concept of function 
was finally clarified when it was separated from the concept of 
curve and was defined in its most general form on the basis 
of the theory of sets: A function is a mapping of one set of 
elements into another set of elements. Or alternatively it is 
defined as a particular kind of subset of the Cartesian product 
of the domain and range. An incidental result of the perfected 
concept of function is that many-valued functions have been 
banished from the realm of legitimate ideas. By definition, 
every function is single-valued. 

What is an integral? 

The first integrals were integrals of continuous functions 
over a finite interval. It soon became necessary to extend the 
concept of integrability to make room for integrals of discon- 
tinuous functions, such as step functions, and for integrals over 
the infinite real line. Lebesgue raised the question of how far 
we can generalize the concept of integrability and of integral, 
and answered it by developing the theory of measure and the 
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Lebesgue integral. Underlying the theory of measure is the con- 
cept of a measurable space defined by means of a <r-ring of 
subsets of the space (a ring closed under differences and de- 
numerable unions). Measure theory now provides a rigorous 
foundation for the theory of probability. 

What is a geometry? 

Felix Klein answered this question with the help of the con- 
cept of a group of transformations. A given space can be 
mapped into itself in many ways. Of particular interest are 
those mappings called transformations that are one to one and 
have the entire space as range. The set of all such transforma- 
tions is a group. For each subgroup of this group it is possible 
to consider questions such as these: What figures in this space 
are mapped onto each other by the transformations of the sub- 
group? What properties of these figures remain unchanged 
when the figures are transformed? Klein proposed the fruitful 
idea that the study of such questions for any subgroup consti- 
tutes a geometry. Thus there is a geometry belonging to every 
one of these subgroups of the full transformation group. On 
the basis of this idea it is possible to set up hierarchies of ge- 
ometries. When the group of one geometry is a subgroup of the 
group of another geometry, the latter geometry is more general 
than the former. The theorems of the geometry determined by 
the larger group become theorems of the geometry determined 
by the smaller group. For example, the geometry of similarity 
in Euclidean geometry belongs to a particular group of transfor- 
mations of affine space. The geometry of congruences belongs 
to a subgroup of that group. The theorems about similar figures 
also apply to congruent figures. That is, congruence is a special 
case of similarity. One of the most beautiful results of this ap- 
proach to the study of geometry was Klein’s demonstration of 
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the existence of both Euclidean geometry and non-Euclidean 
geometry within projective geometry. 

Axiomatization of Euclidean geometry 

Euclid’s Elements undertook the task of developing geom- 
etry as a deductive system, in which all theorems are derived 
from explicitly stated postulates. Euclid did his work so well 
that the Elements became the model and the inspiration for 
other postulational systems, even outside the domain of math- 
ematics. For example, Newton wrote his Principia and Spinoza 
tried to write his Ethics as a sequence of theorems derived 
from explicitly stated assumptions and appropriate definitions. 
Nevertheless, the model itself is imperfect as a postulational 
structure. Euclid injected circular reasoning in the form of 
proofs by superposition, and he left a big gap in the structure 
when he failed to deal with relations of order. These defects 
were corrected in Hilbert’s axioms for Euclidean geometry. 
Many other ways of axiomatizing Euclidean geometry are pos- 
sible. Birkhoff has prepared a set of axioms that uses length of 
a line and angle measure as primitive notions. Prenowitz has 
prepared another set of axioms built around the notion of a 
convex set. The latter two sets of axioms have been offered as 
having the advantage of leading to a rigorous development of 
Euclidean geometry that is simple enough to be within the 
grasp of high-school students. 

I have described eight areas in which mathematics has sub- 
jected itself to searching self-criticism in order to build a rigor- 
ous foundation for its elaborate superstructure. If you reexam- 
ine what I have said about each of these eight areas you will 
find that I had to use the word “set” in order to describe what 
was done. All avenues of investigation of the foundations of 
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mathematics converge toward set theory. In a systematic de- 
ductive development of mathematics, all constructions radiate 
out from set theory like the spokes of a wheel from a hub. Set- 
theoretic concepts are as necessary to mathematical discourse 
today as common nouns are to ordinary discourse. The analogy 
invoked by this remark, by the way, is not a superficial one, in 
view of the fact that every common noun defines a set. 


Modern Mathematics 

Modern mathematics is the direct result of the multiplicity 
of spaces and geometries and the multiplicity of algebraic 
structures developed by classical mathematics. Modern mathe- 
matics is to classical mathematics as elementary algebra is to 
elementary arithmetic. Elementary arithmetic deals with many 
numbers in the real number system. However, its statements 
are always assertions about relationships connecting specific 
numbers. Elementary algebra, on the other hand, by using 
variables, has a way of making assertions that are valid for 
many numbers or even for all numbers in the real number sys- 
tem. For example, whereas arithmetic may say 2 + 3 = 3 + 
2, algebra will say x + y — y + x. Similarly, while classical 
mathematics deals with many different mathematical struc- 
tures, its typical approach is to study relationships in one struc- 
ture at a time. Modern mathematics, on the other hand, studies 
at one stroke the properties of all structures of a particular 
type. Thus, while classical mathematics may study the real 
number system, which happens to be an ordered field, modern 
mathematics will study ordered fields in general. Naturally, 
whatever is discovered about ordered fields in general will ap- 
ply to the real number system in particular. 

The purpose of modern mathematics of dealing with many 
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structures at once dictates the form that it takes. Modern 
mathematics is necessarily axiomatic, deductive, and abstract. 

It defines a type of structure, such as a field, as a set of elements 
and relations satisfying certain axioms. As in Euclid’s axio- 
matic treatment of geometry, it deduces theorems from the axi- 
oms. The treatment is abstract in the sense that no meanings 
are attached to the terms and relations used in the axioms other 
than those expressed by the axioms themselves. Thus, in a 
topological space, a point should not be understood as some- 
thing that can be represented by a dot on the blackboard. It 
is merely an element in a set which has special subsets known 
as open sets that satisfy certain axioms. The advantage of the 
abstract approach is that one abstract structure may have many 
concrete representations. By not being tied to one interpreta- 
tion of the terms, we are free to use many interpretations. Thus, 
there are topological spaces whose points are the points of ele- 
mentary geometry. There are topological spaces whose points 
are the lines of elementary geometry. There are topological 
spaces whose points are functions. Similarly, there are many 
possible interpretations for the points of a Euclidean plane to 
which the ideal points on the “line at infinity” have been added. 
They may be interpreted as the lines in a bundle of lines 
through a point in Euclidean three-space. They may also be in- 
terpreted as certain equivalence classes of triples of real num- 
bers. 

Some of the abstract structures of modern mathematics are 
referred to as algebraic structures. Others are referred to as 
geometric structures or spaces. What is the distinction between 
them? An algebraic structure is defined by means of operations 
analogous to addition and multiplication, satisfying certain 
axioms. Groups, for example, are defined in terms of one op- 
eration. Rings and fields are defined in terms of two opera- 
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tions. A space, however, is defined by means of certain distin- 
guished subsets that satisfy particular axioms. Thus, a pro- 
jective space is defined in terms of the subsets called lines, 
a topological space is defined in terms of the subsets called 
open sets, and a measurable space is defined in terms of subsets 
called measurable sets. 

At one time the geometry of Euclid was unique in being 
axiomatic and deductive. Now all mathematical structures are 
axiomatic and deductive. One of the arguments for teaching 
Euclid in the high schools used to be that it was the best vehicle 
for introducing young people to a deductive system. This argu- 
ment is no longer valid. To introduce young people to deduc- 
tive systems, we now have many deductive systems to choose 
from. We don’t have to wait for the tenth year to do it, and 
we don’t have to restrict deductive reasoning to the study of 
geometry. 

A characteristic of modern mathematics is its attempt to 
be as general as possible. A modern mathematician is not con- 
tent with proving a theorem by means of given axioms. As soon 
as he has proved the theorem, he tries to see how many axioms 
he can leave out and still prove the theorem. He tries to find 
out how much he can weaken his axioms and still be able to 
prove the theorem. In short, he is never happy with merely 
finding sufficient conditions for his theorem. He wants to know 
which conditions are also necessary. By proving a theorem 
with the weakest possible set of assumptions, he finds the 
broadest domain to which the theorem applies. 

The modern mathematician is concerned with whole struc- 
tures rather than individual members of the structure. His 
typical technique for studying a structure is to map one struc- 
ture into another. Naturally a central role is played by the 
structure-preserving mappings known in general as homo- 
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morphisms, and as isomorphisms if they are one to one and 
onto. Thus, the algebraist uses mappings that put sums into 
sums and products into products. The student of projective 
geometry uses mappings that put lines into lines (collinea- 
tions). The student of topology uses mappings that put open 
sets into open sets (open mappings) or, more frequently, map- 
pings for which the inverse image of an open set is an open set 
(continuous mappings). The student of measure theory uses 
measurable functions, for which the inverse image of a meas- 
urable set is a measurable set. 

The modern mathematician often uses a classical mathemat- 
ical structure as the starting point of his investigations. He ana- 
lyzes the structure to isolate from it certain separate qualities. 
He studies these qualities in abstraction in separate structures. 
Then he puts the structures together in various combinations. 
Thus he finds that the real number system may be viewed as a 
group, a ring, a field, an ordered set, a topological space, etc. 
So he studies abstractly groups, rings, fields, ordered sets, top- 
ological spaces, and so on. Then he studies such things as 
topological groups, topological rings, topological fields, or- 
dered fields, etc., in which two structures are combined into 
one. After he has found many qualities in a given structure, he 
is often particularly interested in finding out how many of these 
qualities he must put together to obtain the original structure. 
For example, the real number system has these properties: ( 1 ) 
It is a number system in the sense that it has two operations 
called addition and multiplication that are associative and com- 
mutative and such that multiplication is distributive with re- 
spect to addition. (2) It is an extension of the rational number 
system. (3) It is ordered. (4) It has the property that any 
nonempty set in it which has an upper bound has a least upper 
bound. The interesting thing about these properties is that they 
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characterize the real number system. Any system that has these 
four properties must be the real number system (that is, is iso- 
morphic to it). 

Since we have stressed the advantages of the formal deduc- 
tive approach of modern mathematics, it is necessary to say 
something about its limitations. Not all mathematics is formal 
and deductive. In the first place, mathematical discovery is not 
deductive. The research mathematician who gropes his way 
toward new theorems is guided by analogy, by hunches, by 
trial and error, and by flashes of intuitive insight. It is only after 
he has made his discoveries that he uses hindsight and shows 
how he could have arrived at his conclusions most economi- 
cally by deductive reasoning. 

Secondly, a deductive system is incapable of supplying the 
justification for its acceptance as a legitimate mathematical 
system. A mathematical system is legitimate only if it is con- 
sistent, that is, free of contradiction. A proof of the consistency 
of a system may be developed as a theorem of metamathe- 
matics (reasoning about the mathematical system, but not in 
that system). But the validity of the proof depends on assum- 
ing the consistency of the system of axioms underlying the meta- 
mathematical argument. In the case of a system that is inclu- 
sive enough to contain the natural numbers of arithmetic, this 
assumption means taking more for granted than does the as- 
sumption that arithmetic itself is consistent. This fact follows 
from Godel’s theorem that such a consistency proof requires 
rules of inference that are even stronger than those of arith- 
metic. Thus, the problem of proving the consistency of arith- 
metic is not solved, but merely shifted to other ground. For 
this reason it is common practice to follow two procedures to 
establish the consistency of a deductive system. One procedure 
is to prove only relative consistency. For example, it is possible 
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to prove that non-Euclidean geometry is consistent if Euclidean 
geometry is consistent. The second procedure is to prove the 
absolute consistency of a system by producing a concrete rep- 
resentation of it. But this, of course, involves an appeal to in- 
tuition. In constructing mathematical systems we cannot ex- 
clude intuition entirely. We can only restrict the area in which 
we must lean on intuition. 

Even if we could do without intuition in the creation of 
mathematics, it would be folly to try to do without it in the 
teaching of mathematics. I am sure that the reader must have 
had the experience I have often had of following a deductive 
argument step by step and ending up not knowing what it was 
that had been proved. While a deductive argument may show 
us where one tree stands in relation to another, an intuitive ar- 
gument is often the best way of seeing the woods that are made 
up of all those trees. 

While we stress the limitations of deductive argument, let us 
not forget, meanwhile, the limitations of intuition. Intuition is 
a useful guide, but sometimes an unreliable one. Let us re- 
member that intuition misled some eighteenth-century mathe- 
maticians into saying that the infinite series 1 — 1 + 1 — 

1 + . . . has a sum equal to Vi. What is “discovered” by 
intuition has to be checked by rigorous deductive argument. 

Mathematics in the form of an axiomatic deductive system 
has sometimes been described as a game. The mathematician, it 
is said, makes up the rules of the game in the form of axioms. 
Then he proceeds to play the game according to the rules. 
Some people have gone beyond this assertion to say that mathe- 
matics is only a game, played without regard to any possible 
applications. This assertion is not correct. It is true that the 
pure mathematician should be free to explore in any direction 
in which his curiosity carries him. But his choice of the rules 
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of his game is not entirely arbitrary. He tries to select rules 
that he and his colleagues will judge to be significant. And the 
measure of significance is usually the extent to which they re- 
late to existing mathematical structures and to practical appli- 
cations. 

Pure mathematics has benefited greatly by growing up in 
intimate contact with practical applications. To see the truth 
of this statement we need only recall how the theory of elliptic 
integrals and functions grew out of the pendulum problem, how 
Fourier series grew out of the study of heat, how the study of 
Riemannian geometry was stimulated by relativity theory, and 
how the study of Hilbert spaces was encouraged by quantum 
mechanics. The greatest mathematicians have always com- 
bined dedication to pure mathematics with a strong interest in 
its applications. This is shown, for example, in the work of 
Gauss, Klein, Poincare, Weyl and Von Neumann, to mention 
only a few. 

The growth of modern mathematics has led to increased 
specialization. Specialization leads to a fragmentation by sub- 
division of subject matter. At the same time, however, there 
is a growing unification of method. Set-theoretic methods per- 
meate all of modern mathematics. Algebraic methods reach out 
into topology. Topological methods are used to deal with prob- 
lems in analysis, etc. For example, in algebraic topology, es- 
sentially geometric configurations are studied by the examina- 
tion of associated groups. The typical technique is to set up 
a sequence of groups such that each group is mapped homo- 
morphically into the next one in the sequence. Now this alge- 
braic technique, developed originally for the study of topology, 
has been generalized and axiomatized in homological algebra 
and has become a technique of abstract algebra. Another in- 
teresting example of the trend toward unification is found in 
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the way an existence proof in the theory of differential equa- 
tions can be derived from a fixed-point theorem in topology. 

There is one fact implicit in what I have said about modern 
mathematics that should be stated explicitly. Modern mathe- 
matics does not replace classical mathematics. It generalizes 
it, supplements it, unifies it, and deepens our understanding 
of it. But classical mathematics in the form of arithmetic, anal- 
ysis, and geometry is as important as it ever was. 

I would like to comment at this point on some aspects of 
the strategy of contemporary mathematics that are important 
even though they are not at all new. There are some typical 
methods of the mathematician that he uses in classical mathe- 
matics as well as modern mathematics. They are worth noting 
here because they are used in elementary mathematics as well 
as in advanced mathematics. 

1. After defining a structure or a configuration of which 
there may be many examples, the mathematician frequently 
sets himself the task of identifying and classifying all the pos- 
sible examples. For instance, algebraists are trying to identify 
and classify all finite groups. They have solved the classifica- 
tion problem for finite commutative groups. They have only 
scratched the surface of the problem for finite groups in gen- 
eral. Classification is often accomplished by picking out a set 
of simple cases and then expressing all other cases as combina- 
tions of the simple cases. For example, finite commutative 
groups are factored into products of cyclic groups. In arith- 
metic, integers are expressed as products of primes. In plane 
geometry, polygons are decomposed into triangles. 

2. Often classification is accomplished by dividing the set 
of things being studied into equivalence classes, and then show- 
ing that each equivalence class contains one and only one ele- 
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ment of a special type. This is the technique of reducing to 
canonical form. In elementary algebra, for example, we reduce 
polynomials to the canonical form a 0 x n + . . . -\- a n . In arith- 
metic we reduce fractions to the canonical form called “low- 
est terms.” 

3. Sometimes we pick out a set of simple cases and show that 
although other cases cannot be reduced to these simple cases, 
at least they can be approximated by them. Approximation 
theorems occur in many branches of mathematics. In the the- 
ory of numbers, for example, there is a theorem that every real 
number can be approximated as closely as we please by a ra- 
tional number whose denominator is not too big. Specifically, 
it says that if a is real and g is a positive integer, we can find 

integers x, y such that \x — ay\ < I, with 1 < y < g. In the 

6 

theory of functions of a real variable we have the Weierstrass 
theorem that any continuous function on the closed segment 
from 0 to 1 can be approximated by polynomials. In algebraic 
topology, compact metric spaces are approximated by finite 
polyhedra. In elementary calculus and in plane geometry we 
often approximate a rectifiable curve by an inscribed polygon. 

4. Sometimes we study a structure by seeing what happens 
when we map it into one of a set of simple and well-known 
examples of that structure. For example, a group is often stud- 
ied by mapping it into a matrix group. A ring may be studied 
by mapping it into the ring of endomorphisms of a group 
(homomorphisms of the group into itself). If we map each 
polynomial with real coefficients into its residue class modulo 
x 2 + 1 , we have a homomorphism that can be used to prove 
that the complex number system exists, that it is a field, that 
it contains the real number system as a subfield, and, of course, 
that it contains a square root of — 1 . 
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5. We have just described four ways in which the simplest 
cases of a structure or configuration are used to study the gen- 
eral cases. There is one technique which uses the opposite ap- 
proach. It studies the general cases by paying particular at- 
tention to the exceptional, pathological case. It is analogous 
to the way in which psychologists learn about normal behavior 
by studying neuroses and psychoses. In the theory of functions 
of a complex variable, for example, we encounter the singular 
points of a function. These are the points where the function 
does not behave itself at all, and in fact, may not even be de- 
fined. Far from ignoring the singular points, we pay particular 
attention to them, because they have a great influence on the 
behavior of the function at the points near the singularity. 
The function behaves in one way near a pole, and in a differ- 
ent way near an essential singularity. Similarly, in the theory 
of systems of ordinary linear differential equations, we find it 
to our advantage to examine the critical point of the system 
(the place where all components of the derivative vanish), 
because the nature of the solutions to the system of equations 
depends on what happens in the neighborhood of the critical 
point. In elementary calculus we use a similar approach to 
help us do curve tracing by first identifying the pathological 
cases: the discontinuities, the maxima and minima, and the 
points of inflection. 

6. An interesting feature of many branches of mathematics 
is the existence of duality. Wherever duality occurs, a theorem 
can immediately be translated into a dual theorem without any 
further proof. In projective plane geometry, point and line are 
dual to each other. In projective three-space, point and plane 
are dual to each other. In linear algebra, a vector space over a 
scalar field and the set of linear mappings of the space into the 
scalar field are dual to each other. In elementary plane geom- 
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etry, there is an imperfect duality of point and line, and a re- 
lated duality of side and angle in a polygon. In spherical geom- 
etry, there is the duality of great circle and pole. Conscious use 
of duality, where it exists, makes possible the discovery of new 
relationships and affords a deeper insight into the relationships 
that hold a structure together. 


Mathematics More and More 
Intimately Related to Man's Activities 

While mathematics, in the form of pure mathematics, has 
reached dizzying heights of abstraction, it has kept its feet on 
the ground by multiplying and extending its applications. 

Now more than ever, it is true that mathematics is the hand- 
maiden of the sciences. Before this century the science of phys- 
ics had already made abundant use of mathematics in mechan- 
ics, optics, the theory of heat, and electromagnetic theory. 
Analysis used to be the chief mathematical tool of the physicist. 
Now, with the development of relativity theory and quantum 
mechanics, he has had to learn Riemannian geometry and 
modern algebra. The increasing role that chance processes play 
in physical theory compels him to learn probability theory as 
well. 

Mathematics has spilled over from physics into the other 
physical sciences, chemistry and geology. It has invaded the 
life sciences, biology and psychology, and has expanded into 
the social sciences too. There is no area of science today that 
can avoid using mathematical methods. 

With the growing complexity of industrial and business life, 
industry and commerce have raised more and more questions 
that can be answered adequately only by the use of mathemati- 
cal methods. Insurance and pension systems use actuarial 
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mathematics. Industries undertaking quality control use sta- 
tistical methods. Commercial establishments choose from 
among alternative courses of action with the help of linear 
programming. Military strategists plan their moves with the 
help of the theory of games. Communications engineers use in- 
formation theory and Boolean algebra. 

To deal with complicated problems at high speed, automatic 
electronic computers have been constructed. Mathematicians 
share in the designing of the computers and in setting up their 
programs. Because of the widespread use of computers, loga- 
rithms have lost their importance as a computational tool. 

To meet the new demands made by science, industry and 
government, mathematics has had to grow in new directions. 
There is a vigorous development of the theory of differential 
equations and methods of solving them by means of computers. 
There has been a tremendous growth in the theories of proba- 
bility and statistics. New areas of study, such as linear program- 
ming, the theory of games, and information theory, have 
sprung up overnight. 

The spread of automation in industry is changing the char- 
acter of the labor force. The tendency is to reduce the amount 
of unskilled and semiskilled labor that is used, and to increase 
the need for technically trained personnel with a knowledge of 
mathematics. The importance of mathematics in vocational 
training will continue to grow in the future. 

The production of food, clothing and shelter has never 
been man’s sole concern. He has always found it necessary to 
ponder over deep questions about man’s relation to man and 
about his place in the universe. Even these questions have now 
taken on a mathematical character. I shall refer to only two 
obvious examples. For thousands of years philosophers have 
had endless debates about the meaning of infinity. Now it is 
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impossible to talk sense on this subject without taking into ac- 
count what mathematics has contributed through the theory of 
the continuum and of transfinite numbers. In the past, philoso- 
phers have speculated about the nature of space. Is it finite 
or infinite? Is it bounded or unbounded? Now it is impossible 
to talk sense about these questions without taking into account 
what Riemannian geometry has contributed through the theory 
of relativity. Mathematics is increasingly important as an es- 
sential ingredient of a liberal education. 
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Reflection, 137 
Repeating decimals, 105 
Residue classes, 62-5, 85-7, 
171-6 

Resultant, 129 
Reversible mappings, 139 
Ring, 61, 66, 74, 83-4, 107, 155, 
162, 172-3, 175 
Roman numerals, 25 

Scalar, 132 

Scalar multiplication, 132-3, 
149 
Set, 36 
Similar, 145 
Similitude, 144 
Standard set, 16 
Subgroup, 63, 67-8 
Subring, 64 
Subset, 36 
Subtraction, 44, 52 


Symmetries of a triangle, 56-9 

Topological space, 115, 120 

Transformation, 137 

Translation, 139 

Union, 37-8 

Unit ideal, 88 

Unity element, 29-30, 41, 50, 64, 
78,84, 107, 133, 151, 153, 155, 
163, 173 

Vector, 132 

Vector space, 133-6, 149, 160, 
162, 172 

Whole numbers, system of, 28, 
30-2, 44ff., 73, 78-9, 82, 164, 
177 

Zero, 28, 61, 74 

Zero divisor, 69, 87, 156 

Zero element, 28-30, 41, 50, 66, 
78, 84, 107, 130, 150-1, 172, 
175 

Zero ideal, 88 
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(continued from front flap) 
electrical engineer describes an 







alternating current. In this way he 
shows the familiar origin of such 
unfamiliar concepts and terms of the 
new mathematics as groups, rings, 
fields, vector spaces, algebras, 
homomorphisms, isomorphisms, and 
homeomorphisms. 

As he reads, and takes pencil and 
paper to work the “Do-It-Yourself” 
sections, the reader will become aware 
that he is merely nibbling at the corner 
of a great rug that has a beautiful but 
intricate design woven into it. If what 


he sees from the corner arouses his 
curiosity about the central design, this 
book will prepare him to move toward it 
through systematic study of advanced 
texts. 


This is a book for the layman who 
has had elementary algebra and plane 
geometry and who wants to enter a 
fascinating new world of knowledge. It 
will be particularly valuable to the high 
school teacher who wants to enrich the 
curriculum, gain new insights into old 
structures, and keep pace with the most 
modern standards of mathematics 
teaching. 
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